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EDGAR RUBIN ’* 
1886-1951 


Edgar Rubin died on May 3, 1951 
after a protracted illness which was 
brought on in part by the hardships 
which he suffered in his flight to Swe- 
den during the German occupation of 
Denmark. He was born on September 
6, 1886, in Copenhagen, where from 
1904-11 he studied philosophy and psy- 
chology under Harald Hoffding and Al- 
fred Lehmann. From 1911-14, under 
the direction of Georg Elias Miiller in 
the Institute of Psychology at the Uni- 
versity of Gottingen, he carried out his 
classical investigations of figure and 
ground. The degree of doctor of phi- 
losophy was conferred on him in 1915. 
The next year he returned to the Uni- 
versity of Copenhagen where for two 
years he was dozent and for four years 
lecturer in philosophy. In 1922 he was 
appointed to succeed Lehmann as pro- 
fessor of experimental psychology and 
director of the psychological laboratory. 
Except for two years of exile in Swe- 
den as visiting professor at the Uni- 
versity of Lund, Rubin was actively 
engaged in teaching and research at 
Copenhagen until the time of his death. 
He was president of the Danish Asso- 
ciation for Philosophy and Psychology, 
and also president of the 10th Interna- 
tional Congress for Psychology, which 
he organized in 1932. 

Aside from his two Danish teachers, 
G. E. Miiller exerted the most decisive 
influence on Rubin’s basic thinking, al- 


1 Translated by the editor from the German. 


though like other experimental psychol- 
ogists he was deeply impressed by the 
phenomenological point of view which 
at that time had pervaded the scientific 
atmosphere of Gottingen as a conse- 
quence of the spell cast by the ideas of 
Husserl. This outlook became apparent 
in his chief work, Visuell wakrgenom- 
mene Figuren: Studien in psychologi- 
scher Analyse, which appeared in Dan- 
ish in 1915. The German edition of this 
work must be regarded as one of the 
books which has had a compelling in- 
fluence on the development of the psy- 
chology of perception in recent decades. 
In this volume Rubin gave evidence of 
remarkable versatility and thoroughness 
in dealing with those optic phenomena 
which result from the articulation of the 
visual field into organized figures and 
unorganized background. The findings 
obtained in these studies and the inter- 
pretation which he gave to them brought 
Rubin into close proximity with Gestalt 
psychology. As in this major work, 
so in all later publications Rubin re- 
vealed an almost exclusive interest in 
questions of perception: perceived move- 
ment in visual perception, tactual and 
auditory impressions, and the sensa- 
tions of temperature as well as gusta- 
tory impressions during reception of 
food. Mention should also be made in 
this connection of the exact and very 
penetrating analyses of the temporal 
sense—a field which unfortunately has 
been badly neglected by experimental 
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psychology. In all of these investiga- 
tions Rubin devoted himszIf with out- 
standing success to clear’ng up the re- 
lations between psychology, physics, and 
epistemology, which nowhere else are 
so numerous and close at hand as in 
the domain of perception. 

In psychology Rubin stood for the 
kind of inquiry which, so far as possi- 
ble, is not encumbered with hypotheses, 
for the kind of investigation in which 
the facts are allowed to speak for them- 
selves. In this regard he was somewhat 
opposed to the efforts, which are by 
no means confined to Gestalttheorie, to 
turn psychology in the direction taken 
with such great success by modern phys- 
ics, that of constructing bold hypothe- 
ses. “Psychologists ought not to use 
theories as a cloak for their ignorance,” 
was one of his frequent assertions. He 
urged instead a procedure which he him- 
self called “adspektiv.” By that term 
he meant that psychologists should not 
build up wholes out of elements de- 
rived from analyses of mental phenom- 
ena along the lines proposed in such 
doctrines as “mental chemistry,” “cre- 
ative synthesis,” and “production-the- 
ory.” They should attempt rather to 
get at the data of mind by observir; 
them from those angles that do not dis- 
tort their intrinsic nature. He tried to 
clarify this point of view in the stand 
which he took towards Hoffding’s con- 
cept of synthesis. “Regarding the prob- 
lems with which I was occupied, the 
mental synthesis of Héffding seemed to 
me to be a inakeshift introduced in 
order to build up the wholes which 
had been broken by the analysis which 
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led to sensations. I considered by not 
breaking up the wholes and by keeping 
to them and their aspects in my in- 
vestigations I could save using this 
makeshift.” “Adspektiv” psychology 
has no traffic with elements. It pre- 
fers to concentrate its research upon 
the varied aspects which the total pat- 
tern presents for inspection. 

The collection of papers, selected by 
Rubin himself and published in 1949 in 
Copenhagen under the title Experimenta 
Psychologica, gives an excellent insight 
into his thinking and his methods of 
inquiry. A thorough study of this vol- 
ume will furnish much lively stimula- 
tion and can be warmly recommended 
to the experimental psychologist. The 
sections dealing with “haptische Unter- 
suchungen” are especially worthy of 
note. These studies reveal the work- 
ings of an unusually able and critical 
mind which combined a reve aptitude 
for certain kinds of psychological prob- 
lems together with a passionate desire 
to unravel the problems by every pos- 
sible experimental device. 

Whoever enjoyed the privilege of 
close association with Rubin as friend 
and colleague had occasion to admire 
his extraordinarily quick comprehension 
of all psychological questions, to note 
his grasp of another’s line of thought 
before it was fully expressed, and espe- 
cially to appreciate the critical gift 
which enabled him to take an argument 
apart and point out its flaws in such 
humorous manner that even caustic crit- 
icism never gave offense. 

Davin Katz 

University of Stockholm 





VICARIOUS TRIAL AND ERROR! 


BY J. G. TAYLOR AND B. REICHLIN 
University of Cape Town 


In certain learning situations ani- 
mals display a form of behavior which 
Muenzinger (4) has called ‘vicarious 
trial and error’ or VTE. This is 
assumed to be a feature of behavior 
at any point of choice, but it is most 
readily seen when the response re- 
quired of the animals involves a fairly 
rapid expenditure of energy, as in 
jumping. When a rat is placed on a 
platform from which it can jump in 
either of two directions, the rewarded 
direction in each trial always being 
indicated by, say, a black card, and 
the unrewarded direction by a white 
card; and if the number of the pre- 
ceding trials has not been sufficient to 
ensure that the response will be invari- 


ably correct, then the animal may go 
to the edge of the platform, say at the 
left-hand side, and stretch its head out 
across the gap in the direction of the 


left-hand stimulus. It may even lift 
its front legs and thrust itself farther 
forward, balancing itself on the edge 
of the platform as if it were about to 
jump. But instead of taking off, it 
may then withdraw, move a little to 
the right and repeat the same per- 
formance in the direction of the right- 
hand stimulus. There may be several 
repetitions of this act before the rat 
finally jumps in one direction or the 
other. -In other cases the stretching 
and balancing components of the act 
may be largely eliminated, and all that 
remains is a moving of the head from 
side to side, an orientation of the rat 


1 The research described in this paper was 
made possible by grants from (a) The S. A. 
National Council For Sociz! Research, and 
(b) The University of Cape Town. - We wish 
to record our warm appreciation of this 
assistance. 


in the direction now of one stimulus, 
now of the other. 

Among the most important facts 
about VTE behavior, as recorded in 
the literature, the following may be 
mentioned. 


1. The number of VTE movements 
per trial is low at the beginning of 
learning. After one or two trials their 
number increases, rising to a maxi- 
mum as the proportion of correct 
choices approaches unity, and then 
falls to zero during the final stage of 
the learning. 

2. The maximum number of VTE 
movements per trial is an increasing 
function of the difference, in j.n.d. 
units, between the stimuli which are 
used to mark the right and wrong 
directions. That is, VTE is more fre- 
quent in the case of an easy discrimi- 
nation than in the case of a difficult 
discrimination. 

3. This relationship is reversed in 
the case of human subjects who are 
merely required to discriminate be- 
tween the two stimuli but to take no 
further action relative to the chosen 
one. 

4. VTE movements occur more fre- 
quently when the stimuli to be dis- 
criminated are relatively near to the 
gap that is to be jumped than when 
they are farther away. 

5. There is more VTE when shock 
accompanies the correct response and 
when sheck accompanies the wrong 
response than when there is no shock 
at all. 

To account for such facts Tolman (6) 
has postulated two sets of intervening 
variables which, for expository pur- 
poses, he has incorporated in a tropistic 
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robot cailed the ‘‘Schematic Sowbug.”’ 
The first of these variables controls 
the robot’s orientation towards the 
stimuli, and, failing the intervention 
of the second set of variables, would 
presumably cause VTE indefinitely. 
A stimulus striking obliquely on the 
bug’s receptor-organ creates an “‘ori- 
entation readiness’’ which diminishes 
as soon as the resulting orientation- 
response brings the stimulus fully to 
bear on the receptor. But this has 
the effect that the second stimulus now 
strikes the receptor obliquely, thus 
arousing another orientation-readiness 
and therefore swinging the robot back 
to its original position; then the cycle 
begins again. The difference between 
the two orientation-readinesses when 
the bug is directed towards one of the 
stimuli is conceived to be an increasing 
function of the difference between the 
stimuli, so that when these are nearly 
alike, and therefore hard to discrimi- 
nate, the orientation-readinesses are 
also nearly equal and there is in con- 
sequence very little VTE. This part 
of the theory therefore accounts for 
the higher VTE freque:cy in the case 
of an easy discrimination. 

The second set of intervening vari- 
ables, called “progression-readinesses,”’ 
have certain features in common with 
Hull’s reaction potential (gE) in that 
their magnitudes are determined, al- 
though in an unspecified manner, by 
drive, incentive motivation and the 
consequences of previous responses to 
the two stimuli. But there is one 
important difference. Hull assumes 
that when there are two competing 
reaction potentials the reaction is de- 
termined by that one which happens 
at the moment to be the stronger, and 
the whole of that stronger sEz is active 
in producing the response. Tolman, 
on the contrary, assumes that the 
energy available for driving the animal 
to jump instead of swinging from side 
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to side is proportional to the differ- 
ence between the two progression- 
readinesses. The weaker progression- 
readiness cancels an equal quantity of 
the stronger one. 

The difference between the pro- 
gression-readinesses at any moment 
depends not only on the differential 
consequences of jumping in the right 
and wrong directions but also on the 
momentary orientation of the bug to 
thestimuli. If, after repeated rewards 
for jumping in the right direction and 
punishments for the wrong direction, 
it is facing the right stimulus, the 
difference between the progression- 
readinesses is positive and the creature 
moves forward to get another reward. 
If it is facing the wrong stimulus the 
difference is negative, or positive but 
negligible, so that the creature holds 
back, thereby allowing the reciprocal 
action of the orientation-readinesses 
to swing it into the right direction. 

This feature of the sowbug’s consti- 
tution was specifically introduced in 
order to explain facts 2 and 3 listed 
above, and it must be admitted that 
it achieves this purpose. A human 
subject, required to choose the darker 
of two nearly equal shades of grey, 
experiences no world-shaking conse- 
quences if his choice should be wrong, 
and his progression-readinesses both 
remain at a low level. It follows that 
when he is looking at the right stimu- 
lus the difference between the two 
progression-readinesses is negligible, 
and he is free to keep on swinging. 
But if black and white are substituted 
for the two greys, then when he is 
looking at the black, the difference 
between the progression-readinesses is 
enough to make him “jump” at once. 
On the other hand a rat that has 
bumped its nose against the wrong 
stimulus in a hard discrimination prob- 
lem and been rewarded for jumping at 
the right one acquires substantial pro- 
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gression-readinesses, but the orienta- 
tion-readinesses are nearly equal, so 
that it tends to jump in the direction 
it happens to be facing. So the man 
facing an easy problem and the rat 
facing a hard one both fail to show 
VTE, but for different reasons: the 
man because the forward urge comes 
into action before the orientation- 
readinesses have had time to deflect 
him from his course, the rat because 
there is no force strong enough to 
deflect him from the first direction he 
happens to take up. 

But in accommodating his robot to 
fit facts 2 and 3, Tolman has made it 
impossible for it to act in accordance 
with facts 1 and 5. Fact 4 he fits 
into the scheme by the simple expedi- 
ent of making his bug short-sighted, 
so that an easy discrimination pre- 
sented at a distance is equivalent to a 
hard discrimination at short range. 
This fits the geometric pattern of the 


sowbug very neatly, but that hardly 
justifies the assumption that rats are 


myopic. Indeed Hebb (1) has pre- 
sented evidence that distant visual 
stimuli are more effective than near 
ones in determining the rat’s behavior. 
The ad hoc sensory defect of the sow- 
bug does not add to our understanding 
of the behavior of real organisms. 
With regard to fact 1, the consti- 
tution of the sowbug is such that at 
the beginning of learning the differ- 
ence between the two progression- 
readinesses is negligible in comparison 
with the difference between the two 
orientation-readinesses when the bug 
is directed towards one stimulus. 
There should consequently be more 
VTE than later when the progression- 
readinesses have expanded a bit. In 
fact, Tolman’s theory leads to the 
deduction that the number of VTE 
movements is a decreasing function of 
the number of trials. This deduction 
is not confirmed by experimental data. 
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At first sight fact 5 seems to agree 
with the theory. The effect of ‘both 
rewarding and punishing a correct re- 
sponse should be that the relevant 
progression-readiness increases rather 
slowly, and the difference between the 
two readinesses increases more slowly 
than when the right response is re- 
warded and the wrong one punished 
with shock, or when there is no shock 
at all. Consequently the orientation- 
readinesses have virtually undivided 
control over the organism, and VTE 
is not likely to be interrupted by a 
forward-moving tendency. But for 
the same reason learning should be 
slow. What ultimately prompts the 
animal to go forward is the difference 
between the two progression-readi- 
nesses, and if that remains low by 
reason of the balancing effect of re- 
ward and punishment for the correct 
response, the probability of the wrong 
response being made must remain 
high. But it does not. Learning is 
in fact quite rapid. 

This brings us to another aspect of 
Tolman’s theory. He thinks that 
VTE is an aid to learning. The 
mechanism of this process is appar- 
ently that when the animal is facing 
in the right direction, the stimulus 
“reminds” him, so to speak, of the 
previous satisfactory consequences of 
responding to it, and that has the 
effect of increasing the relevant pro- 
gression-readiness. Similarly, facing 
in the wrong direction diminishes the 
other progression-readiness. Hence 
the more VTE, the more quickly 
the difference between the progres- 
sion-readinesses grows, and the more 
quickly the animal learns. But if 
facing in the right direction reminds 
him of both reward and punishment, 
while facing in the wrong direction 
reminds him only of previous unevent- 
ful but futile excursions, the difference 
between the two progression-readi- 
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nesses is going to grow quite slowly. 
But that is just the reverse of what 
happens. 

The shortcomings of this theory are 
obvious enough. Indeed Tolman’s 
own explanations of behavior of the 
kind the bug was invented to account 
for keep on bursting beyond the 
bounds of the creature’s none too 
tough cuticle. And the chief reason 
for this seems to be that the charac- 
teristics put into the bug were selected 
with the specific purpose of explaining 
VTE. It is theoretically possible to 
construct a robot that will imitate 
almost any selected aspect of beha- 
vior, and there are probably several 
ways of constructing any such robot. 
But our purpose as scientists is not to 
construct any mechanism that will 
perform the desired trick; it is to dis- 
cover, if possible, the mechanism that 
actually does perform the trick in 
living organisms. And that means 
that the mechanism must be con- 
sistent with other behavioral mecha- 
nisms disclosed by research. For this 
reason we propose to present in the 
following pages a hypothesis about 
VTE based on the principles of beha- 
vior as summarized by Hull (2), since 
these principles seer to us to be the 
most consistent and systematic expla- 
nation of the behavior of organisms 
that has yet appeared. 

We begin by postulating that a VTE 
movement is essentially an incom- 
plete or preparatory response which 
we shall symbolize by p. Since p 
commonly involves a smaller expendi- 
ture of energy than the full response, 
R, to which it leads, its threshold may 
be assumed to be lower than that of R. 
That is, sL, < sLe. Hence if sEp is 
insufficient to evoke R, it may be high 
enough to evoke p. And when gEpr 
> slr, p will merge with R, so that 
it cannot be readily detected as a 
separate component of behavior. In 
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situations where there is a choice be- 
tween alternative responses, R, and 
Rz, there will be two preparatory re- 
sponses, p; and p:, and when the rele- 
vant reaction potentials are both below 
the reaction threshold but above the 
threshold for o, that is, when 


sL, < sEr, < slr 
sL, < sEr, < slr 


then the uncorrelated oscillations to 
which sEp, and sEp, are subject have 
the effect that the animal sometimes 
reacts with p;, sometimes with pz, and 
sometimes with neither. He reveals 
VTE. 

We hold that VTE is not a special 
form of behavior arising only in a 
situation involving discrimination, and 
demanding a special behavior mecha- 
nism of its own, but is simply a suc- 
cession of incomplete or preparatory 
responses of the same type as the 
hesitations and false starts that may 
be seen when animals are learning to 
respond to a situation which does not 
call for a choice between alternatives. 
For this reason we propose to test our 
hypothesis in the first instance by 
applying it to a learning process that 
is not complicated by the element of 
choice. 

Our basic assumptions are (1) that 
preparatory movements are mediated 
by the same reaction potential that 
mediates the complete response, and 
(2) that the threshold for preparatory 
movements is lower than that for the 
complete response. It has been shown 
by one of us (5) that the character- 
istics of the distribution of reaction 
velocity (sVe), defined as the recip- 
rocal of reaction latency, can be de- 
duced from the expansion of the bino- 
mial (¢ + 7)*, where 7 is the square of 
the proportion of the oscillation distri- 
bution curve above the reaction thresh- 
old, andi +j7= 1. Hence if we can 


and 


Te ee, 
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find some measure of the preparatory 
responses analogous to s Vr, we should 
be able to predict the way in which 
the distribution of this measure differs 
from that of sVr. 

An obvious way to measure the 
velocity of preparatory responses is to 
multiply the velocity of the complete 
resnonse, sVr, by the number of pre- 
paratory responses, nm’, preceding the 
complete response R. But since we 
assume that p is an integral part of R, 
it follows that the total number of p 
is one more than the number of abor- 
tive responses. We must therefore 
multiply sVre by mn’ +1. This is 
equivalent to taking the reciprocal of 
the mean interval of time from the 
beginning of one preparatory response 
to the beginning of the next. 

For three reasons this is not a very 
satisfactory measure. In the first 
place, the reciprocal of the mean inter- 
val of time between one p and the next 
is not necessarily the same as the 
mean of the reciprocals of the actual 
intervals between successive occur- 
rences of p. But as we have not in 
our experiments measured these inter- 
vals we must perforce be content with 
the compromise. 

The second source of error arises 
from the fact that, especially at the 
beginning of learning, a rat often turns 
its back on the stimulus and engages 
in prolonged exploration of features 
of the apparatus that are irrelevant 
to the act that is to be learned. Dur- 
ing such activity the relevant stimulus 
ceases to act, and the stimulus-trace 
may even vanish. That is, the factor 
which Hull has called stimulus-inten- 
sity dynamism (3) drops to zero. 
Since the duration of this activity is 
included in gtr it follows that the 
mean velocity of p may be consider- 
ably unde: -stimated. 

To deal with this source of error we 
measured not only the reaction la- 
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tency, str, but also the total time, ¢,, 
when the animal was facing in the 
direction of the stimulus. It was this 


latter value that we finally decided to 
use for the purpose of calculating sV,, 
the velocity of the preparatory re- 
sponses. The formula we used was: 


n' +1 


sV, = ty 


where n’ is the number of p move- 
ments preceding the final one which 
merges into R. 

There is a third source of error for 
which we have been unable to find any 
satisfactory corrective. We counted 
as preparatory responses only those 
movements which were obviously di- 
rected towards the stimulus, but it is 
clear that any movement which did 
not get as far as that, and was there- 
fore not recorded, should also be 
counted as a preparatory response. 
No doubt this error is balanced to 
some extent by the use of /, instead 
of str, for the former measure excludes 
the time during which the stimulus- 
trace is active although the stimulus 
itself is not acting directly on the 
receptors. That is to say, both the 
number of recorded responses and the 
measured time during which they 
occur are too small, and the estimate 
of sV, is therefore likely to be not too 
far removed from its true value. 

If we are right in assuming that p 
and R are both mediated by the same 
reaction potential, sEx, and that they 
have different thresholds, it follows 
that the argument presented in the 
article on reaction latency (5) referred 
to above is applicable to the distri- 
bution of sV, as well as to gVp, but 
that these distributions will have dif- 
ferent constants. 

Corollary 1 of the reaction latency 
article states that the mean sVz is 
proportional to the squares of the pro- 
portion of the sOr distribution that 
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stands above sLr. From this, taken 
in conjunction with the postulate that 
sL, < sLe, we deduce that the mean 
sV, is higher than the mean sV x in 
the same trial. For if the threshold 
of p is below that of R, there is a 
larger proportion of the sOe distribu- 
tion above sL, than sLr. This con- 
clusion also follows from our method 
of calculating sV,, so that its empiri- 
cal confirmation is of no significance. 
It is stated here merely for the sake 
of completeness. 

Corollary 4 states that the distri- 
bution of sVe is positively skewed at 
the beginning of learning, symmetrical 
in the middle and negatively skewed 
at the end. This depends upon the 
increase in the proportion of the sOr 
curve that is above the threshold as 
reaction potential increases. When 
sEp is just above the threshold, ¢ is 
very small in comparison with j, so 
that the expansion of (¢ + 7)* yields 
a skew distribution. But at the same 
time there is a substantially larger 
proportion of the sOg curve above 
sL,, and the distribution of sV, is 
therefore less heavily skewed than 
that of sVe. It follows further that 
as sEp increases the distribution of 
sV, becomes symmetrical while that 
of sVz is still positively skewed, and 
is negatively skewed when the distri- 
bution of sVeg is symmetrical. In 
short, the distribution’ of sV, goes 
through the same transformation as 
that of sVz but starts at a later stage. 

Corollary 7 states that ‘‘as learning 
progresses the standard deviation of 
sVe gradually increases, reaching a 
maximum when the distribution is 
symmetrical, and then gradually di- 
minishes.’’ This follows from the fact 
that the standard deviation of the 
binomial distribution is Vkij. Since 
at the beginning of learning the dis- 
tribution of sV, is more nearly sym- 
metrical than that of sVe, it follows 
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that its standard deviation is nearer 
its maximum value. It reaches its 
maximum earlier and is already de- 
clining while that of sVz is approach- 
ing its maximum. 

Corollary 8 states that the coeffi- 
cient of variability of sVer is a de- 
creasing monotonic function of the 
number of reinforcements. This de- 
duction obviously applies equally to 
gV,, but we may deduce further that 
since the initial value of the mean sV, 
is higher than that of s Vp, the initial 
value of the coefficient of variability 
is lower, and it remains lower through- 
out the course of learning. 

Corollary 9a states that ‘‘the square 
root of the ratio of the mean s Vz to the 
maximum sVp( VV) is an increasing 
ogival function of the number of rein- 
forcements (), the exact form of this 
function depending on the relation- 
ship between the standard deviation 
of oscillation (¢)) and reaction poten- 
tial (sEr).” This deduction applies 
to sV, as well as sVr, but since the 
initial value of the mean gsV, is a 
higher proportion of its final (maxi- 
mum) value than the initial value of 
the mean sVz is of its maximum, it 
follows that the initial level of the 
ogive is higher in the case of sV,. It 
is, in fact, a truncated ogive. 

In addition to the above deductions 
there are others which are not directly 
dependent on those stated in the reac- 
tion latency article. These concern 
the number of preparatory responses 
that may be expected at different 
stages in the learning process. A dia- 
gram will help to make the argument 
clear. Figure 1 shows the two reac- 
tion thresholds, sZ, and sLz and the 
sOr curve cutting across these at 
various stages in the learning process. 
These curves are shown as positively 
skewed at the beginning and nega- 
tively skewed at the end of learning. 
There is as yet no experimental evi- 
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dence to justify this, but even if it 
should prove wrong, the argument 
that follows is not affected. The 
curves are also shown as first increas- 
ing in range and then decreasing. 
There is some evidence in support of 
this (5), but again the general tend- 
ency of our argument is not affected. 

Consider now the first sOg curve. 
A very small proportion of its area is 
above sLr, and consequently gs Vx will 
be very small. The area above gL, is 
larger, but it is less than half the total 
area, so that sV, will also be small. 
That is to say, there will be long inter- 
vals between preparatory responses. 
Passing to the second curve we find a 
slight increase in the area above sLr 
and a very substantial increase in the 
area above sL,. Hence the mean 
sVe has increased slightly, but the 
mean gV, has increased to a much 
greater extent. This means that the 
frequency of preparatory movements 
per unit of time has increased much 
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more rapidly than the frequency of 
the complete response. 

As we pas* from stage 2 to stage 3 
the proportion of the area above sLz 
increases with positive acceleration 
whereas the increase in the area above 
sL,*is negatively accelerated. This 
means that although the frequency of 
preparatory responses per unit of time 
continues to increase, the absolute 
number of such responses is likely 
to decrease by reason of the shorter 
length of time that elapses before the 
process is terminated by the complete 
response. 

By the time we reach stage 4 the 
entire sQOr curve is above sL,, and 
hence preparatory responses occur 
with maximum velocity. But now 
more than half the curve is above 
sLr, and that means that the time 
available for preparatory responses is 
severely curtailed, so that their abso- 
lute number drops steeply. At the 
last stage, when most of the sOz curve 
is above slp, the latency of R has 
nearly reached its minimum value, 
and hence p nearly always passes with- 
out a break into R. That is to say, 
the absolute number of preparatory 
responses has fallen practically to zero. 

To make these deductions more pre- 
cise we may assign numerical values 
to the proportions of the sOg curve 
above the two thresholds and calcu- 
late the number of preparatory move- 
ments that may be expected at each 
stage. 

Table 1 shows, in the first two rows, 
the proportion of the sOz curve above 


TABLE 1 








Stage 





Proportion of sOg above sLz 
above sl, 
Square of proportion above sLz 
above sl, 


Number of preparatory movements 100 





i 
02 
.20 
004 
04 
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the two thresholds. In the next two 
rows these proportions are squared. 
Since the velocities of R and p are 
proportional to the values in the 3rd 
and 4th rows, respectively, the abso- 
lute number of preparatory move- 
ments is given by dividing the entry 
in the 4th row by the corresponding 
entry in the 3rd row. This of course 
includes the final p which merges 
into R. 

These deductions may be summed 
up in the following two statements. 
(A) The frequency of preparatory re- 
sponses per unit of time is an increasing 
function of the number of reinforce- 
ments, rising at first with positive and 
then with negative acceleration. (B) 


The absolute number of preparatory 
responses per trial is a decreasing func- 
tion of the number of reinforcements. 

It should be noted however that in 
practice the absolute number of pre- 
paratory responses in the early stage 
of learning is likely to be substantially 


less than theory would lead one to 
expect. When reaction potential is 
low, many such responses will be 
initiated, but will be terminated be- 
fore their true character becomes obvi- 
ous to the experimenter. The theory 
makes no distinction between re- 
sponses of this kind and those that 
are obviously directed towards the 
goal, but in practice it is only the 
latter that are recorded. As reaction 
potential increases, the proportion of 
preparatory responses that get ‘‘nipped 
in the bud” is likely to get much 
smaller, so that few of them will pass 
unnoticed. 

To test our hypothesis we have data 
from the experiment described in the 
latency article. In this experiment 
118 rats were taught to jump onto the 
entrance platform of a rectangular 
maze with a food box at each side. 
Both food boxes were open, and each 
contained 2 grams of food. The drive 


J. G. Taytor AND B. REICHLIN 


was 23 hours of hunger. For each 
rat preliminary training was continued 
until it reached a criterion of two suc- 
cessive unprompted jumps over a 6- 
inch gap, and on the following day the 
gap was widened to 8 inches. There 
were no other changes. The animals 
were classified into four groups on the 
basis of the number of days of pre- 
liminary training they required, and 
subsequent training was continued 
until all the animals in a group had 
reduced their jumping lztencies to .7 
seconds or less. Later the animals 
were reclassified on the basis of the 
number of days they required to re- 
duce their latencies to .7 seconds or 
less, and it is this revised classification 
that we shall use here. 

In addition to the reaction latencies, 
the experimenter recorded, by means 
of a stop-watch with a separate device 
for returning the hands to zero, the 
total time (t,) during which the animal 
appeared to be reacting to the maze 
by preparatory movements, and also 
the number (n’) of distinct prepara- 
tory movements. For this purpose 
actions of the following types were 
considered to be preparatory move- 
ments: (1) a move across the front of 
the platform from left to right or from 
right to left; (2) a movement of the 
head or of the head, shoulders and 
front legs, towards the right or left 
half of the maze entrance without 
bodily movement across the platform; 
and (3) a direct walk from the back of 
the platform towards the front. 

It will be seen that some of these 
actions are very similar to typical VTE 
movements, although it is obvious 
that they cannot be explained by Tol- 
man’s theory, since no discrimination 
is called for. There is no difference 
in difficulty between jumps to the 
right, left and center of the entrance 
platform; and after the jump the ani- 
mal is still free to choose between the 
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right and left alleys, so that even if 
discrimination is called for it comes 
into operation only after the animal 
has jumped. The movements we have 
recorded are therefore preparatory to 
the jump and not necessarily to the 
choice between alleys, and for that 
reason a simple movement from back 
to front of the platform, which would 
not be regarded as VTE, is legitimately 
included as a preparatory movement. 

Table 2 presents the mean, standard 
deviation and coefficient of variability 
of sV, at each trial for each of our 
four experimental groups. It also pre- 
sents, in the 4th column of each sec- 
tion, ¥V,, which is defined as the root 
of the ratio of the mean gsV, to the 
maximum. As in the case of sVrp, 
the maximum velocity is taken to be 
2, so that V, = an This has the 
effect that the beginning of the final p 
which merges into R is taken to be 
contemporaneous with the beginning 
of R. At the early trials, when the 
latency of % is substantial, this identi- 
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fication of the beginning of the final p 
with the occurrence of R has little 
effect on the estimate of sV,; but at 
the later trials, when there is only 
one preparatory movement, and that 
passes without « break into R, it is 
obvious that the latency of p is over- 
estimated. We had at first thought 
of meeting this difficulty by adding 
2 instead of 1 to the number of re- 
corded preparatory movements, but 
this proved to be excessive in that it 
gave rise to certain absurdities that 
came to light when the theory was 
applied to selective learning, and ac- 
cordingly we decided to add 1 only. 
This means that the maximum sV, 
is taken to be too small, with the result 
that the early values of VV, are over- 
estimated. (On the other hand, the 
addition of 2 to m’, which implies that 
the maximum vclocity is 4, probably 
underestimates .V,.) At the later 
tria!s the number of animals for which 
n’ = 0 increases, and since for each of 
these animals sV, is underestimated, 
the mean becomes progressively dis- 
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TABLE 3 
CONSTANTS OF THE sVg DISTRIBUTIONS 
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torted. This is clearly seen in group 
C, where the mean drops from .940 
at the 7th trial to .853 at the 8th 
trial. The mean mn’ +1 at the 7th 
trial was 1.3, and at the next trial it 
fell to 1.0. That is to say, about 30 
per cent of the animals for which the 
estimated sV, was reasonably correct 
at the 7th trial were added at the 8th 
trial to the group for which sV, was 
underestimated. This sudden increase 
in the proportion of underestimated 
values is reflected in the means. There 
is a similar phenomenon in group D. 
In groups A and B there is no inver- 
sion of the sV, curve, but there ap- 
pears to be an inflection in group B at 
the point where n’ becomes zero. 

Since our deductions involve com- 
parisons between sVz and sV,, the 
relevant data from the reaction latency 
article are reproduced in Table 3. 

A comparison of the A columns in 
Tables 2 and 3 shows that our first 
deduction is confirmed, but this in 
itself, as we have already stated, is of 
no scientific significance. 

Before considering the remaining 


deductions it is necessary to refer to 
a discrepancy between theory and 
observation which escaped notice in 
the reaction latency article. In that 
article it was deduced that the stand- 
ard deviation of sVz would rise to a 
maximum when the distribution was 
symmetrical and then decline. Ex- 
amination of Table 3 shows that the 
standard deviation reaches its maxi- 
mum quite near the end of learning, 
whereas symmetry of the distributions 
is reached about the middle. The 
standard deviation continues to in- 
crease while the form of the distribu- 
tion is changing in the direction of 
negative skewing. 

A possible explanation of this dis- 
crepancy is suggested by Hull’s mime- 
ographed Memorandum of May 16, 
1950, in which it is deduced that ‘The 
distribution of reaction latencies, con- 
ditioned, ¢.g., to the subsident phase 
of the stimulus trace, will have its 
median in the recruitment phase, 
skewing off with a relatively long tail 
in the subsident phase of the trace.” 
This implies that there is a source of 
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variability in reaction latency, and 
hence in velocity, over and above that 
discussed in the latency article. There 
it was assumed, although not explic- 
itly stated, that the stimulus acts 
continuously and uniformly until its 
effect is terminated by the response. 
The strength of the stimulus-trace will 
of course vary a great deal while the 
animal is on the jumping platform, 
especially in the early stages of learn- 
ing, when it frequently turns its back 
on the stimulus. But each prepara- 
tory movement involves a new pres- 
entation of the stimulus, so that the 
trace is continuously being renewed 
and held at a relatively high average 
strength. As latency diminishes, and 
with it the number of preparatory 
movements, it becomes increasingly 
probable that the animal's behavior is 
determined by a single snap-shot pres- 
entation of the stimulus. In that case 
the stimulus-trace can no longer be 
regarded as constant, but must be 
assumed to increase rapidly and di- 
minish slowly, so that its value at 
the moment of response-evocation be- 
comes an additional source of varia- 
bility in the velocity both of the 
response itself and of the preparatory 
movements. 

At the beginning of learning, when 
sEz is low, the reaction is likely to be 
evoked only when the stimulus-trace 
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is at or near its maximum value. 
With increase in sEp progressively 
lower values of the trace become ade- 
quate to evoke the response, so that 
variability arising from the course 
of the stimulus-trace becomes an in- 
creasingly important factor. Hence, 
towards the end of learning this addi- 
tional source of variability is likely to 
delay the diminution of the standard 
deviation predicted from the binomial 
theorem. 

Since we have assumed that the 
threshold for preparatory movements, 
sL,, is lower than sLp, it follows that 
low values of the stimulus-trace may 


‘be effective in evoking prepaiatory 


movements at a time when high values 
are still required for the evocation of 
a jump. Hence the additional source 
of variability due to the course of the 
stimulus-trace becomes effective in re- 
lation to preparatory movements be- 
fore it begins to affect the complete 
response. 

In order to see how variability aris- 
ing from the stimulus-trace affects the 
problem, we may consider a skew fre- 
quency distribution of latencies such 
as Hull has deduced. Figure 2A rep- 
resents such a curve plotted against a 
scale of latencies. The scale values 
have been chosen in such a way as to 
yield a range of velocities such as we 
find near the end of learning. In Fig. 
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2B the scale of latencies has been con- 
verted into a linear scale of velocities 
and reversed in direction. The ordi- 
nates of curve A have been plotted 
against the corresponding velocities to 
produce curve B. It will be seen that 
the velocity curve is slightly skewed 
in the positive direction, and this effect 
would be more marked if the minimum 
latency were taken to be .2 instead of 
.3. Latencies of .2 are in fact not 
infrequent in the case of preparatory 
movements, and if all such movements 
could be detected, the minimum la- 
tency might prove to be even smaller, 
with the result that the distribution 
of velocities would be more heavily 
skewed in the positive direction. 

The two sources of variability, de- 
duced from the stimulus-trace and 
from the binomial theorem, thus result 
in distributions that are skewed in 
opposite directions, and the empirical 
distributions of sV, will therefore be 
more nearly symmetrical than the 
binomial theorem would lead us to 
expect. 

This deduction is confirmed by our 
empirical data. As a typical example 
we present in Table 4 the distributions 
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of sVz and sV, for che 4th to the 7th 
days of group C. The scales are dif- 
ferent in each case, but these are not 
presented, since we are here concerned 
only with the shape of the distribu- 
tions. On the 4th day the distribution 
of sVe is positively skewed, but that 
of sV, is more nearly symmetrical, in 
accordance with the deduction from 
the binomial theorem. On the follow- 
ing days the distribution of sVe be- 
comes more symmetrical, but there is 
only a slight tendency to change in 
the direction of negative skewing in 
the sV, distributions. Inspection of 
the mean velocities, in the last row of 
the table, shows that the latencies of 
R are greater throughout than those 
of p. On the 5th day, for example, 
the latency corresponding to the mean 
sVe is 4.6 sec., while that of sV, is 
1.3. Variation in the stimulus-trace 
is therefore more likely to be effective 
in modifying the dispersion of sV, 
than of sVr. That is to say, the 
change in the direction of negative 
skewing is likely to be masked. 
Tables 2 and 3 show that the stand- 
ard deviation of sV, apparently reaches 
its maximum at about the same time 














TABLE 4 
4th day Sth day 6th day 7th day 
sVeR 8Vp sVe BV, sVeR Vy, sVe 8Vp 
1 5 6 2 11 1 3 2 
18 6 9 4 7 3 4 3 
10 4 6 4 7 2 4 8 
5 9 5 2 3 6 6 6 
2 6 4 10 5 4 4 1 
1 5 4 10 1 12 2 13 
1 2 3 3 2 4 6 2 
0 0 0 2 1 0 3 0 
0 1 1 1 2 3 5 1 
1 1 0 1 3 0 2 
0 1 2 1 
0 
1 
Means .129 .378 .216 .753 381 .930 .593 .940 
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as thatof sVr. This is not in accord- 
ance with the deduction from the 
binomial theorem, but there are two 
factors that probably distort the re- 
sults. In the first place, when the 
number of recorded preparatory re- 
sponses falls to zero, sV, is assumed 
to be identical with Vp, and is there- 
fore underestimated. It is not im- 
possible that if the true values of sV, 
could be determined, their dispersion 
would turn out to be more skewed 
than that of sVz and the standard 
deviation would be lower. 

The second distorting factor is the 
variability arising from the course of 
the stimulus-trace. Since we have 
assumed that this variability becomes 
increasingly important as the super- 
threshold value of sEe increases, it 
follows that at any stage this factor 
plays a relatively greater part in de- 
termining the dispersion of sV, than 
of sVr. Hence at a time when the 


dispersion of sVz is symmetrical and 
is still mainly determined by the bi- 
nomial expansion, the variability of 
sV, is being largely influenced by the 


stimulus-trace. As this factor tends 
to produce a tail in the positive direc- 
tion, while the binomial distribution 
has a tail in the negative direction, 
the standard deviation must be greater 
than the binomial theory predicts. 
We submit therefore that there is no 
essential conflict between theory and 
observation. 

Tables 2 and 3 show that our next 
deduction, concerning the coefficient 
of variability, is confirmed, although 
here again the distorting factors re- 
ferred to above are in evidence. For 
each group the coefficient of varia- 
bility of sV, begins at a lower level 
than that of sVz, and diminishes from 
trial to trial. In groups C and D 
there is an inversion at the point where 
the number of recorded preparatory 
responses vanishes, but this is fully 








Trial 





ia 
AnPwne OCI AUMER WN 


SES85052 

















explained by the errors due to con- 
founding sV, with sVr. 

The 4th column for each group in 
Tables 2 and 3 shows that our deduc- 
tion concerning the proportion of the 
sOr curve above the threshold is con- 
firmed. The proportion above gL, is 
in each case greater than the propor- 
tion above slp at the beginning of 
learning, and rises to the same level at 
the end. The curve is thus a trun- 
cated ogive, as we have deduced. It 
is of some interest to note that the 
initial level of this curve is a decreasing 
function of the number of reinforce- 
ments required to reach the criterion 
of learning. This may be readily ex- 
plained by reference to the fact that 
two unprompted jumps were required 
before the main experiment began. 
The rapid learners would obviously 
acquire a larger increment of sEz from 
these two reinforcements than the 
slower learners. 

Our final deduction is that the abso- 
lute number of preparatory responses 
per trial is a decreasing function of 
the number of reinforcements. Table 
5 shows, for each group, the average 
number of such responses at each trial, 
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including the movement which merges 
into the jump. 

The number of preparatory move- 
ments shown here is very much smaller 
than the theoretical numbers shown 
in Table 1, but this is to be expected 
in view of the difficulty of detecting 
any but overt responses. In spite of 
that, the number of movements de- 
creases from trial to trial, as we pre- 
dicted. 

It will be observed that the number 
of preparatory movements in the first 
trial increases from group A to group 
D. This too is in agreement with 
theory. Although our deduction was 
stated as-a relationship between the 
number of movements and the number 
of reinforcements, it is obvious that 
this relationship depends on (a) the 
relation between sEz and the number 
of reinforcements, and (b) the relation 
between the proportions of the sOr 
curve above the two thresholds and 
sEr. The number of preparatory 
movements is directly determined by 
the latter relationship. As we noted” 
above, the level of sEr at the first 
trial is highest for group A and lowest 
for group D. Hence the increasing 
number of movements from A to D 
represents a further empirical confir- 
mation of our hypothesis. 
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The present discussion deals with be- 
havior in a situation which involves no 
choice between alternative responses, 
and therefore gives no opportunity 
for VTE as defined by Muenzinger. 
We believe that VTE is essentially a 
succession of preparatory movements 
directed towards alternative goals, and 
in that case it should be amenable to 
treatment by methods analogous to 
those we have applied to such move- 
ments in a situation where there is 
only one goal. A paper on this sub- 
ject is in course of preparation. 
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WHAT IS A FORM?* 


BY JAMES J. GIBSON 
Cornell University 


The term form is used by different 
people to mean different things and by 
the same person to mean different things 
on different occasions. It can refer to 
the curved surfaces of a human female 
or the contours of a crankshaft, to a 
polyhedron or the style of a man’s ten- 
nis game. Shape, figure, structure, pat- 
tern, order, arrangement, configuration, 
plan, outline, contour are similar terms 
without distinct meanings. This indefi- 
nite terminology is a source of confu- 
sion and obscurity for philosophers, art- 
ists, critics, and writers. It is an even 


more serious difficulty for scientists and 
psychologists. Ambiguity is excusable 
in the preliminary exploration and dis- 
cussion of a problem, but it cannot be 
tolerated when a theory has reached the 
stage of experimental verification. A 


more rigorous terminology is very much 
needed. The psychological problem of 
how animals and men perceive form re- 
quires a definition of what it is that is 
perceived. Experiments in the field of 
form-perception and constancy of shape 
can only be decisive if one experimenter 
knows what the other is talking about. 

When the environment of an individ- 
ual is said to consist of objects, places, 
and events, a rough threefold classifica- 
tion of “formal” properties is suggested. 
One ordinarily applies the term form to 
an object, arrangement to a place, and 
order to an event. We might agree that 
the perception of a single object (one 
delimited by a surface), the perception 
of a set of objects (a region of space), 

1 This inquiry is connected with a series of 
experiments on the visual perception of the 
environment performed under Contract AF41 
(128)-42 between Cornell University and 
the USAF School of Aviation Medicine. 


and the perception of what happens to 
objects (movement and sequences in 
time) are three distinguishable prob- 
lems each of which deserves a terminol- 
ogy of its own. The terminology with 
which this paper is concerned is the 
first. The only kinds of visual form we 
shall undertake to deal with here are 
those associated with or derived from 
physical objects. Perhaps the simplest 
kind of shape is that embodied in an 
isolated object. If so, the effort at 
definition should begin with this kind." 

Even in this limited sense there seem 
to be at least three general meanings for 
the term form. There is first of all the 
substantial shape of an object in three 
dimensions. Second, there is the pro- 
jection of such an object on a flat sur- 
face, either by light from the object or 
by the human act of drawing or the 
operation of geometrical construction. 
Images, pictures, drawings, and outlines 
are examples of form in this second 
sense. Third, there is the abstract geo- 
metrical form composed of imaginary 
lines, planes, or families of them. In 
this last sense of the term, form is 


1 There are, of course, many other formal 
properties of the environment even beyond 
those listed, such as the expression of a face, 
the composition of a painting, and the struc- 
ture of a family. All these are unquestionably 
perceived, but they are so complex that the 
task of the psychologist is probably one of 
exploratory experiment rather than attempting 
to make exact definitions. Some psychologists 
in the tradition of phenomenology have tried 
to describe these higher-order varieties of 
form. The overall quality of a picture or a 
poem which makes it “good” would not be 
made any more intelligible, however, by im- 
posing a terminology on critics of art and 
literature. The writer is only proposing that 
we be precise at a simple level where it may 
be profitable. 
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said to be farthest removed from “sub- 
stance.” The reservation should be 
noted, however, that geometrical forms 
are presented to students as perfectly 
substantial black marks on white paper. 
If one simply puts the question, How 
do we perceive form? without distin- 
guishing between these three meanings 
of the term, a clear answer can hardly 
be given. Nevertheless, the problem of 
form-pergeption in psychology has usu- 
ally been put in this way. What are 
the main solutions to the problem of 
form-perception? One kind of answer 
is to suppose that man simply has a 
form-sense. This is in effect no answer 
at all. It is still accepted, however, 
especially among ophthalmologists and 
physiologists. An alternative kind of 
ansr-er is to assert that man. somehow 
learns to perceive form. This answer 
is also widely accepted among some 
psychologists, although no one has been 
able to show just how this learning 
might occur. A third and more sophis- 
ticated answer is that of the Gestalt 
theorists who propose the general for- 
mula that the excitations on the retina 
are converted into forms by a process 
of organization in the brain, and who 
look to the general “laws of form” for 
an explanation of form-perception.’ All 
these answers seem to take it for granted 
that a form is simply a form and that 
since everyone knows what the term 
means there is no need to specify it. 
Whether the form referred to is a physi- 
cal thing or is an abstract property of a 


2 The original laws of visual ,organization 
were formulated by Wertheimer (8) on the 
basis of observations with spots, lines and out- 
lines, and their implications were developed by 
Koffka (4). The effort of Kéhler to explain 
form-perception. (5, 6) appears to be a search 
for laws of electrical current flow in the brain 
rather than laws of perceived form. Kéhler 
believes, however, that these will ultimately 
prove to be the same, on the basis of his 
hypothesis of “isomorphism” between cortical 
process and phenomenal percept. 
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physical thing is not clear. Experiment- 
ers sometimes refer to a form, which im- 
plies that it is concrete, and at other 
times to the form of a thing, which im- 
plies that it is abstract. The only way 
to discover what they mean is to exam- 
ine what they put in front of the ob- 
server’s eyes during the experiments. 
In the vast majority of studies of form- 
perception it is artificial deposits of one 
or another sort on a paper surface. The 
theories of visual form, on the whole, 
have been based on evidence obtained 
with outline drawings. 

It will be argued here that drawings 
are particularly inappropriate objects 
with which to begin a study of the per- 
ception of form. A drawing is a human 
production never found in a natural en- 
vironment; it is complicated by being a 
thing with which men communicate with 
one another; it is not a simple presenta- 
tion to sense-organs but a representation 
or a substitute-object. As a stimulus 
for perception it is convenient, but it 
is far from being the primary or fun- 
damental stimulus which psychologists 
have usually taken it to be. 

The reasons for supposing that the 
primary kind of form is a drawn form 
would make a long chapter in the his- 
tory of scientific thought. A main rea- 
son, however, is the classical assumption 
that two-dimensional vision is immedi- 
ate, primitive or sensory, while three- 
dimensional vision is secondary, derived, 
or perceptual. One must first see a 
plane form before one can see a solid 
form. This notion is connected with 
the argument that the three-dimensional 
properties of things can have no corre- 
lates in a two-dimensional retinal image, 
and that the three-dimensional proper- 
ties must therefore be reconstructed by 
the mind or the brain. The writer has 
suggested elsewhere that this argument 
is a fallacy (1). So far from plane vi- 
sion being primary and solid vision sec- 
ondary, it is the other way around: 
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There is overwhelming evidence to show 
that solid vision is primary and that 
plane vision is acquired only with train- 
ing and by adopting a special attitude. 
The impression of a visual world may 
well prove to have a straightforward ex- 
planation; the impression of a visual 
field, however, is a very sophisticated 
kind of seeing and its explanation is 
far from being simple. 

It is possible to revise the traditional 
view that our visual sensations are two- 
dimensional and our perceptions are 
three-dimensional. ' When the doctrine 
is thus turned upside down it may ap- 
pear strange on first inspection, but its 
intelligibility is much improved. The 
existing theories of form-perception aim 
first at two-dimensional form. If they 
fail to be convincing, the reason may 
be that three-dimensional impressions 
which have the property of shape are 
actually easier to account for than two- 
dimensional impressions which have the 
property of shape. Perhaps the theories 
have been off on a false scent. 

After a criticism as sweeping as the 
abeve, it is only fair to invite some re- 
turn. A set of definitions and distinc- 
tions will therefore be proposed for 
the main types of visual form. If 
they are unacceptable, at least they 
will be clearly so. What are the various 
intelligible meanings that can be as- 
signed to the term? 

la. Solid form. The closed physical 
surface enveloping a substance of some 
kind; the margin between two states of 
matter (usually between a solid and 
air). The surface may be curved or it 
may be composed of adjoining flat sur- 
faces with edges; the former type can 
always be treated mathematically as a 
special case of the latter when the num- 
ber of flat segments is very large. Or- 
ganisms tend to have curved surfaces 
and to change their form with growth 
(7); fabricated objects and a few natu- 
ral objects like crystals tend to have ad- 
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joining flat faces and to resist trans- 
formation. 

Objects have solid forms in the sense 
defined. When we perceive a detached 
object we also see a solid form—the 
depth, relief, or modelling of the sur- 
face. How we do so is a problem of 
long standing. 

lb. Surface form. A flat physical 
surface with its edges; the face of an 
object (or one of the faces of a thin 
sheet of material such as paper). A 
surface-form always has an orientation 
which we shall term slant. Slant can 
be defined as the angle of inclination 
of the surface to the line of sight or, if 
preferred, to the axis of gravity (2). 

Perceiving a surface-form involves 
perceiving both the slant of the surface 
and the form of its edges; an impres- 
sion of form is never obtained without 
some accompanying impression of the 
angle at which the surface lies, either 
frontal or inclined. The problem of 
shape-constancy, so-called, is better for- 
mulated as the problem of seeing shape- 
at-a-slant. 

If the modelling of a solid form is re- 
ducible to the varying slants of its faces, 
a solution for the problem of how we see 
slant ought to provide a solution for the 
problem of depth and relief. The slant 
of a surface is a physical variable which 
is simpler to define and easier to manip- 
ulate than is the modelling of an object. 

2a. Outline-form. Physical tracings 
made with ink, pencil, or paint on a 
surface, which geometrically represent 
the edges of a surface-form or the mar- 
gins of a solid form. ‘These tracings 
have a finite thickness; they are drawn 
lines rather than the theoretical lines of 
geometry. They have two margins in- 
stead of the one margin exhibited by 
the edge of an object. 

The perception normally aroused by 
an outline-form is quite unlike the out- 
line itself. The paper surface is scarcely 
seen and a different surface seems to 
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emerge within the outline. The paper 
surface appears to become “background” 
and to recede while the inclosed surface 
seems to take on “figural” qualities and 
to stand out (4). This, however, is a 
sophisticated report. Most observers 
perceive an cbject and do not see trac- 
ings on a surface at all. When you press 
the question, however, they tell you that 
they do not literally see a physical ob- 
ject but a picture of it. Hence the per- 
ception is not like that of a solid form.* 

2b. Pictorial form. Any representa- 
tion of a physical object on a surface 
by drawing, rendering, painting, pho- 
tography or other means. This would 
include outlines, silhouettes, plan-views, 
engineering drawings, and perspective 
drawings; it extends to transparencies, 
images projected on a screen, motion 
pictures and in short, to the vast variety 
of things we call pictures. One feature 
is generally to be found—a frame, usu- 
ally rectangular, edging the surface on 
which the representation appears. A 
pictorial form is normally presented to 
the eyes with the surface perpendicular 
to the line of sight, that is, at a zero 
slant. 

Some pictures represent other physi- 
cal properties of an object in addition 
to the margins and surfaces of the ob- 
ject, such as color, texture, shading, and 
motion. A color photograph or a 16th 
century Dutch still-life are examples. 
If a sufficient number of variables has 
been incorporated in the deposits of pig- 
ment or dye on the surface, an instruc- 
tive result may be achieved by an in- 


8A corollary of this definition is that the 
figure-ground phenomenon has been derived 
fro 1 the perception of outline-forms, not from 
a study of all forms or of all perception. The 
universality of the phenomenon as ordinarily 
described is therefore questionable. It is one 
of the most convincing tenets of Gestalt 
theory in its battle with elementarism but 
whether it will serve as the fundamental basis 
for a complete theory of perception is not so 
certain. 
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genious experimenter. He may fool an 
observer into believing he sees a real 
object instead of a picture. When it 
is carefully arranged that the picture is 
seen through an aperture so that the 
frame is invisible, the head is motion- 
less, and only one eye is used, the re- 
sulting perception may lose its repre- 
sentational character. This may be 
termed a “peephole situation.” In these 
circumstances, a pictorial form is equiv- 
alent to a solid form or, as we say, the 
observer has the illusion of reality. It 
may be noted that his retinal image 
closely resembles the one he would 
have in an actual peephole situation 
with a solid form.* What this dem- 
onstration brings out is the fact, often 
forgotten, that a pictorial form as ordi- 
narily viewed induces a quite different 
type of visual perception from that of 
a solid form. 

Any pictorial form, including the spe- 
cial case of a simple outline form, has 
been defined as a representation of an 
object. We therefore need a definition 
of a representation. How is one made? 
The fundamental! types are (a) the plan 
and (b) the perspective of a surface- 
form. . 

2c. Plan-form. Outlines indicating 
the plan-projection or “plan-view” of 
the edges of a surface form. A plan- 
view is exemplified by an engineering 
drawing. It does not involve a trans- 
formation. The terms projection and 
transformation will be defined later. 

2d. Perspective-form. Outlines indi- 
cating a perspective-projection or “per- 
spective view” of the edges of a surface 
form. This always involves a trans- 
formation (relative compression or fore- 
shortening). We say that such a draw- 
ing shows the object in perspective. 


* Other techniques of pictorial viewing which 
aim at the complete illusion of reality, no- 
tably stereoscopic movies, are impressive but 
fail to achieve this end. As long as the frame 
is visible, a picture will look like a picture. 
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It is true that, after training, adults 
can visualize the perspective-form of an 
object they see without having a draw- 
ing or picture in front of them. This 
training is what enables artists to make 
perspective drawings without using spe- 
cial optical or geometrical techniques. 
Children, in general, cannot do so. This 
ability to visualize a thing on a picture- 
plane is probably what lends plausibil- 
ity to the unfortunate doctrine that we 
have sensations of form mediated by a 
“form-sense.” ‘The assumption is that 
the retinal image of a three-dimensional 
object is a perspective picture in two 
dimensions and that hence the resulting 
sensation must be a perspective-form in 
two dimensions. The doctrine is then 
faced with the knotty problem of how 
the sensation can be converted into a 
three-dimensional perception. 

2e. Nonsense-form. Tracings on a 
surface (a pictorial form) which do not 
specifically represent (are not a projec- 
tion of) a recognizable object. A draw- 


ing may be meaningless because (a) the 
projection is crude or inaccurate or (b) 
the tracings are accidental, like a child’s 
scribble or the contours of an ink-blot, 
or (c) the tracings have a plan or sys- 
tem not designed to be a projection of 


a recognizable object. These latter are 
what modern artists often construct and 
call abstract forms, but they should be 
distinguished from the abstract geomet- 
rical forms to be defined later. They 
are also called non-objective forms or 
non-representational forms, and these 
terms are better. It should be noted 
that a drawing may also be meaningless 
for a quite different reason, because, 
although it is an accurate projection 
of an object, the object is not recog- 
nizable to the observer. Biological 
drawings and mathematical construc- 
tions are often of such a nature. 

The fact is that nonsense-forms are 
never nonsensical; they are never ac- 
tually meaningless to an observer, but 
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are simply unspecific or ambiguous. 
The perceiver discovers a succession of 
objects in the picture or, if not objects, 
then surfaces, edges, and fanciful con- 
structions which are often aesthetically 
interesting. 

We come next to the genuinely ab- 
stract forms of geometry. They are 
certainly not substantial surfaces and 
edges although they are just as cer- 
tainly connected with these things. 
They are also not tracings on paper, 
although they are represented or sym- 
bolized by such. They are in a class 
by themselves. 

3a. Plane geometrical for:r. An im- 
aginary closed line on an imaginary 
plane. A geometrical line has no width 
and a geometrical plane has no thick- 
ness. A geometrical line is indicated 
or suggested by a substantial tracing 
on a substantial surface, but the two 
should not be confused. Geometrical 
lines and planes can be specified by 
the equations of analytic geometry more 
accurately than they can be drawn on 
paper. For the practised mathemati- 
cian equations are often preferable to 
drawings. Geometrical forms are infi- 
nitely variable and only a very few spe- 
cial cases of them have names. Words 
like triangle, rectangle, square, circle 
stand for only the most familiar geo- 
metrical forms. 

3b. “Solid” geometrical form. An 
imaginary closed surface in an imag- 
inary space of three dimensions. The 
forms of solid geometry are, of course, 
no more solid than the blue sky. They 
are the prototype of all ghosts. They 
are, in fact, the ghosts of objects just 
as planes are the ghosts of surfaces, 
lines the ghosts of edges, and points the 
ghosts of particles. We can conceive a 
geometrical form but we cannot see it 
in the same sense that we see an object, 
for the form is an abstract property of 
many objects. 

3c. Projection. Projected form. A 
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geometrical form on one plane which 
is in an exact correspondence with a 
form on another plane, the correspond- 
ence being defined as point-to-point or 
one-to-one. The relation between a 
form and its projection is physically 
exemplified by an object and its shadow 
(silhouette) or an object and its pin- 
hole-image, that is to say by optics. 
If the planes of the two forms are paral- 
lel, the forms are geometrically similar 
or congruent, like scale-drawings or 
plan-views; if the planes are not paral- 
lel, one form is a perspective-transfor- 
mation of the other. 

Conceived thus, a given plane geo- 
metrical form is only one of an infinite 
set of perspective-transformations. The 
physically analogous fact is that when 
an objective surface-form is projected 
on another surface by light, the differing 
orientation of the form to the surface 
yields a set of different perspective- 
forms. The psychologically analogous 
fact, one might suppose, would be that 
when a surface-form is viewed at dif- 
ferent angles of regard, the perceiver 
obtains a set of different perspective- 
impressions. The difficulty for psy- 
chology is that under ordinary circum- 
stances he does not; instead he obtains 
a constant percept of the surface-form 
with a varying impression of slant. 


The Problem of Form-Perception 


The above definitions provide a ter- 
minology which can now be applied. 
Returning to the original question of 
how we perceive form, the obvious re- 
ply must be, what kind of form? The 
question must be divided into three or 
more questions. Solid or surface forms, 
pictorial forms, and geometrical forms 
—these at least must be treated sepa- 
rately. 

Perceiving surface-forms. If the psy- 
chologist can explain how we see a given 
face of a solid object having a certain 
form at a certain slant, the explanation 


of how we see the whole object in three 
dimensions can be derived. This is the 
problem of shape-constancy. It has 
usually been assumed, in thinking about 
shape-constancy, that seeing a form 
without slant was simpler than seeing 
a form with slant. Since the retinal 
image is flat, there is supposed to be a 
retinal form, a two-dimensional form, 
or a “pattern-stimulus,” which initiates 
the process of perception. But what 
could this retinal form possibly be? It 
is clearly neither a substantial form 
nor an abstract geometrical form. Per- 
haps it is a pictorial form—the “retinal 
picture” of commonsense psychology. 
Nothing, however, could be more mis- 
taken. A picture is something to be 
looked at. The retinal image could 
only be a picture if there existed a per- 
ceiver behind the eye to look at it. The 
retinal image is none of the kinds of 
form defined; it is in fact not a form 
at all. It is a complex of variables of 
light-energy, definable in terms of steps 
and gradients but not in terms of physi- 
cal edges, geometrical lines, or graphic 
outline (1). Ordinarily there is a dual 
complex of energy (a pair of images) on 
a bifurcated receptor-surface (a pair of 
retinas). We do not see our retinal im- 
ages; we see an object, and the process 
is mediated by the images. The images 
as such may prove to be definable in 
terms of order (perhaps the kind of 
order exemplified by the number-series) 
iut not form. The writer believes that 
the retinal images should be conceived 
as a kind of “ordinal stimulation” (1, 
Chap. 5).° 

In order to understand the perceiving 


5In connection with the analysis of visual 
images in terms of variables, it is interesting 
to note that closed geometrical forms are 
variables when one considers their perspective 
transformations, and that different kinds of 
geometrical forms (e.g., triangle, square, cir- 
cle) can probably be specified in terms of 
variables, although little mathematical effort 
has been expended in this direction. 
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of a surface-form, therefore, the prob- 
lem is to specify the variables of stimu- 
lation which elicit the perceived proper- 
ties of the object—the slant of the sur- 
face together with the form of its edges. 
Both the surface and its edges are spe- 
cifically given in retinal stimulation but 
they are not represented there. The 
pair of images is a correlate but not a 
copy of the object. The nearly constant 
appearance of these edges at differing 
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slants in the case of ordinary naive per- 
ception will probably prove to be sim- 
ply a by-product of the specifying of 
this stimulation. The dependence of 
phenomenal slant-shape upon the im- 
pression of slant may then be clarified. 

Perceiving objects mediated by pic- 
tures. The question of what and how 
we see when we look at a picture of 
some kind is quite different from the 
question formulated above. The proc- 
ess of object-perception is surely sim- 
pler than the process of picture-percep- 
tion. Despite the fact that men have 
been making drawings for thousands of 
years and during the past century have 
invented a variety of displays which our 
ancestors never dreamed of, we know 
very little about what happens within 
us when we see a picture. This much 
is certain, however. Pictures are on a 
surface whereas substantial objects are 
a surface. Moreover pictures stand for 
substantial objects in addition to being 
substantial objects. An outline-form, a 
painting, a photograph, or a radar 
screen-picture are each a sign of some- 
thing else. What the observer ordi- 
narily perceives is the object, place, or 
event represented, and this fact poses 
a special problem of perception. 
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' Take for example an outline-form, 
i.e., a line drawing on paper. It is al- 
most impossible to avoid seeing the 
properties of a surface-form having 
edges which stand out from the back- 
ground. This “piece of surface” may 
have a form and a slant quite different 
from the form of the tracings and the 
slant of the paper. This last fact is so 
important that an experimental demon- 
stration of it is worth reporting. 
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(5) 


The outline-forms illustrated, drawn 
on cards, were shown in succession to 
an observer with the instructions, “Tell 
me what you see on the card. Keep 
looking at it, and if what you see 
changes describe it also.” For each 
card the verbal descriptions were re- 
corded and later classified. Every O 
reported two or more perceptions within 
60 seconds and some had as many as 
seven. Ten Os were used. 

At no time did any O describe any- 
thing like black deposits or marks or 
traces on a white surface. All the terms 
and phrases used fell into three other 
classes: lines and angles, geometrical 
figures, and solid objects with physical 
surfaces. The first two kinds were very 
infrequent; the great majority of terms 
referred to objects. Evidently what 
every O saw “on the card” was seen 
with what might be called the pictorial 
attitude. The physical objects reported 
were highly variable, differing from one 
O to another and successively for the 
same 0. They could be divided into 
two sub-classes: (1) objects for which 
the outline-form was a plan-view, and 
(2) objects for which it was a per- 
spective-view. With only one or two 
exceptions, every observer saw every 
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outline-jorm in each of these two ways. 
The first drawing could be a horn or 
a road curving up a hill, the second an 
arrow or a tent-roof, the fourth a pulley 
with rope or a cannon-muzzle, the sixth 
a truncated pyramid or a carpet on the 
floor, and so on. It is obvious that a 
single outline-form may elicit percep- 
tions of two quite different solid forms. 
The form of a surface with its edges 
(e.g., the carpet) may be very different 
from the outlines representing the edges 
of such a surface (e.g., the trapezoid). 
It should now be clear why outline 
drawings are not appropriate stimulus- 
objects with which to begin the study 
of form-perception. They are habit- 
ually taken to stand for something other 
than what they are and, more impor- 
tant, what they stand for is often equiv- 
ocal. An outline, representing as it 
does only the edges of a surface, may 
stand for any object which projects that 
particular outline, including some very 
queerly shaped surfaces. For instance, 
a given trapezoid, or a trapezoidal pen- 
cil of light-rays, may stand for a square 
at a given slant but it may also stand 
for any of an infinite set of different 
trapezoids at other different slants. 
Conceiving the matter in this way, it is 
only to be expected that the perceiving 
of outline-forms is fluid, changeable, and 
seems to have a spontaneous character. 
The perceptions they arouse are un- 
stable because they are equivocal rep- 
resentations. It is not necessary to in- 
fer that the stimulus-distribution is 
unstable, nor to suppose that it moves 
in the direction of equilibrium or “good 
form.” Nearly all the research on form- 
perception has utilized outline-drawings 
as stimuli. If it be granted that these 
are actually pictures of forms, the re- 
search is irrelevant to the problem. A 
genuine psychophysics of form-percep- 
tion will have to deal with “shape-slant,” 
i.e., with transformations of form which 
co-vary with degrees of slant. In con- 


WME, beget etn: 


GIBson 


trast with an outline drawing, a pair of 
retinal images of the usual sort contains 
stimulus-variables for the perception of 
slant (2). Only in “peephole situa- 
tions” are these stimulus-variables so 
impoverished that the perception of 
slant becomes ambiguous. 

Visualizing geometrical forms. To 
the question of how we perceive form 
in the third general meaning of the 
term, the answer is probably that geo- 
metrical forms are not perceived at 
all. Geometrical forms have no stimuli 
or, more exactly, are not in psychophys- 
ical correspondence with stimuli. They 
are not seen directly like substantial 
forms, or indirectly like represented 
substantial forms, but instead are con- 
ceived or abstracted from innumerable 
past seeings of both. Not much is 
known about the process of abstraction 
or concept-formation, but it is fairly 
certain that a child can identify a sim- 
ple object at a very early age, a repre- 
sented object at a later age, and a con- 
cept at only a much later age. The 
geometrical forms we are talking about 
are conceptual or general. It must be 
remembered that the geometrical tri- 
angle referred to in a theorem is a tri- 
angle in general, not just the particular 
form in the geometry textbook. 

What kind of forms are the drawings 
in the textbooks then? One might be 
tempted to say they are pictorial forms, 
but this would not be strictly correct. 
Outline-forms do not represent geomet- 
rical forms in the way that they repre- 
sent the edges of surface-forms, or ob- 
jects. A geometrical drawing may be 
said to signify a whole set of projective 
transformations. A set of transforma- 
tions is even more ghostly than a single 
geometrical form. An outline taken in 
this sense is more nearly a symbol than 
it is a picture. The relationship of 
standing for is more dependent on an 
arbitrary convention, and therefore on 
learning, than it is in the case of a pic- 
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ture. Consequently an outline-form 
presented to an observer without any 
other indications of the object repre- 
sented is even more ambiguous than it 
was made out to be in the last section. 
In addition to being equivocal as a 
picture, it is a symbol for a bevy of 
geometrical ghosts. To assume that 
it constitutes a simple “stimulus” for 
perception is completely misleading. 
Patterns and textures as distinguished 
from forms. The term pattern has so 
far neither been used nor defined. In 
psychological usage it seems to refer 
to (1) a group or arrangement of single 
objects, or (2) a group of artificial 
traces on a surface, such as the patterns 
used to exemplify Wertheimer’s laws of 
visual organization, or (3) a group of 
natural inhomogeneities on a surface. 
In the latter meaning, a pattern passes 
over into being a texture. The latter 


term, in the writer’s opinion, should 
mean an arrangement of visible parti- 
cles not on but in a surface—the visible 


structure of a surface itself. This is 
important because of the possibility 
that the optically corresponding texture 
of the retinal image of the surface is 
the adequate stimulus-condition (or one 
of the stimulus-conditions) for the im- 
pression of the surface (2). The writer 
has suggested that the impression of a 
surface is essential for the perception 
of determinate visual space (1). The 
interrelationships between visual acuity, 
visual texture, surface-perception, and 
space-perception remain to be defined. 

The first and second meanings of the 
term pattern—a grouping or arrange- 
ment—need definition and analysis as 
much as any of the others, but the task 
is beyond the scope of this paper. So 
also do “structure,” “sequence” and the 
higher order varieties of Gestalten. The 
feeling of mystery that attaches to all 
such words ought to be dispelled, be- 
cause they will be even more interesting 
when comprehended. 
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Conclusions. A number of explicit 
definitions of visual forms have been 
proposed out of a conviction that psy- 


.chologists should come down to earth 


and say exactly what they mean when 
they talk about form. The suggestion 
was made that the kind of forms here- 
tofore studied—pen or pencil tracings— 
is artificial, and as a type of stimulus 
for perception is equivocal. Such forms 
are either projections of disembodied 
edges or symbols for ghostly abstrac- 
tions. The forms we need to investigate 
first are embodied in chunks of physical 
surface. The kind of form for which 
there exists an unequivocal stimulus is 
a form imbedded in a surface—that is 
to say, a shape-at-a-slant. When a 
form is not imbedded in a surface (and 
when, as a result, contour-stimulation 
is not accompanied by surface-stimula- 
tion) the resulting percept is ambigu- 
ous. Since the slant of the presumed 
surface is equivocal, the form is also 
equivocal, and what the observer sees is 
open to the influence of assumed proba- 
bilities, clues, unconscious inferences, the 
standards of past experience, or the so- 
cial norms of group life. Since nearly all 
the experimental research on form-per- 
ception has been performed with outline- 
stimulation alone, we are tempted to 
conclude that all form-perception de- 
pends on probabilities, inferences, and 
norms—in other words, on subjective 
factors. This conclusion is unwarranted. 
Important as these factors no doubt are, 
the primary problem for psychologists 
is to isolate the invariant properties in 
visual stimulation which are in psycho- 
physical correspondence with constant 
phenomenal objects. According to the 
proposed definitions solid forms and 
surface-forms are realities. Outline- 
forms and also pictorial-, plan-, perspec- 
tive- and nonsense-forms are representa- 
tions which the perceiver takes to stand 
for realities. For these a special theory 
of picture-perception is required. Geo- 
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metrical forms, both plane and solid, are _ 


abstractions which cannot even be rep- 
resented, strictly speaking, but can only 
be specified by symbols. If the defini- 
tions are accepted, there is no such thing 
as form-in-general with the universal 
characteristics ascribed to it by Gestalt 
theorists. None of the above forms is 
a whole which is different from its parts. 
None is organized in any special sense. 
None is in the least dynamic. It is pos- 
sible to understand, however, why these 
characteristics have been ascribed to 
visual Gestalten. The reason is prob- 
ably that we have studied only the dis- 
embodied varieties of form—.e., ghost 
shapes—which are ambiguous represen- 
tations or equivocal symbols, and which 
consequently yield fluid, variable, or 
inconsistent percepts. 

If such be the case, the effort to de- 
termine what happens in the brain when 
one perceives form-in-general will prove 
to be fruitless. Theories such as those 
of Kohler (5, 6) and, more recently, 
Hebb (3) seem to be efforts of this sort. 
At least three separate levels of theory 
will be required: first, a theory of how 
we perceive the surfaces of objects—a 
theory of slant-shape or, in older words, 
of shape-constancy; second, a theory of 
how we perceive representations, pic- 
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tures, displays, and diagrams; and 


third, a theory of how we apprehend 
symbols. There is no reason to sup- 
pose that the physiological concomi- 
tants of all these experiences will be 
the same; in fact, since pictures and 
symbols presuppose objects, their physi- 
ological explanations will probably have 
to ve found at increasing levels of com 
plexity. When these three levels of 
theory have been developed, the cate- 
gory of “form-perception” in psychol- 
ogy will have evaporated. 
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A MODEL FOR STIMULUS GENERALIZATION 
AND DISCRIMINATION 


BY ROBERT R. BUSH! AND FREDERICK MOSTELLER 
Harvard University* 


INTRODUCTION 


The processes of stimulus generali- 
zation and discrimination seem as fun- 
damental to behavior theory as the 
simple mechanisms of reinforcement 
and extinction are to learning theory. 
Whether or not this distinction be- 
tween learning and behavior is a useful 
one, there can be little doubt that few 
if any applications of behavior theory 
to practical problems can be made 
without a clear exposition of the phe- 
nomena of generalization and discrimi- 
nation. It is our impression that few 
crucial experiments in this area have 
been reported compared with the num- 
ber of important experiments on simple 
conditioning and extinction. Perhaps 


part of the reason for this is that there 
are too few theoretical formulations 


available. That is to say, we con- 
ceive that explicit and quantitative 
theoretical structures are useful in 
guiding the direction of experimental 
research and in suggesting the type of 
data which are needed. 

In this paper we describe a model, 
based upon elementary concepts of 
mathematical set theory. This model 
provides one possible framework for 
analyzing problems in stimulus gen- 


1SSRC-NRC Post-doctoral Fellow in the 
Natural and Social Sciences. 

? This research was supported by the Lab- 
oratory of Social Relations, Harvard Univer- 
sity, as part of a program of the Laboratory's 
Project on Mathematical Models. 

We are indebted to many persons for as- 
sistance and encouragement, but in particular 
to F. R. Brush, C. I. Hovland, K. C. Mont- 
gomery, F. D. Sheffield, and R. L. Solomon. 
We are also grateful to W. K. Estes for send- 
ing us a pre-publication copy of his paper(2). 


eralization and discrimination. Fur- 
ther, we shall show how this model 
generates the basic postulates of our 
previous work on acquisition and ex- 
tinction (1), where the stimulus situa- 
tion as defined by the experimenter 
was assumed constant. 

Stated in the simplest terms, gener- 
alization is the phenomenon in which 
an increase in strength of a response 
learned in one stimulus situation im- 
plies an increase in strength of response 
in a somewhat different stimulus sit- 
uation. When this occurs, the two 
situations are said to be similar. AlJ- 
though there are several intuitive 
notions as to what is meant by “simi- 
larity,’’ one usually means the proper- 
ties which give rise to generalization. 
We see no alternative to using the 
amount of generalization as an opera- 
tional definition of degree of “‘simi- 
larity.” In the model, however, we 
shall give another definition of the 
degree of similarity, but this definition 
will be entirely consistent with the 
above-mentioned operational defini- 
tion. 

We also wish to clarify what we 
mean by stimulus discrimination. In 
one sense of the term, all learning is a 
process of discrimination. Our usage 
of the term is a more restricted one, 
however. We refer specifically to the 
process by which an animal learns to 
make response A in one stimulus situ- 
ation and response B (or response A 
with different “strength”’) in a differ- 
ent stimulus situation. We are not 
at the moment concerned with, for 
example, the process by which an 
animal learns to discriminate between 
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various possible responses in a fixed 
stimulus situation. ‘ ‘ 
As prototypes of the more general 
problems of stimulus generalization 
and discrimination, we shall consider 
the following two kinds of experiments: 


(i) An animal is trained to make a 
particular response, by the usual rein- 
forcement procedure, in an experimen- 
tally defined stimulus situation. At the 
end of training, the response has a certain 
strength or probability of occurrence. 
The animal is then “tested” in a new 
stimulus situation similar to the training 
one and in which the same response, 
insofar as it is experimentally defined, 
is possible. One then asks about the 
strength or probability of occurrence of 
the response in this new stimulus situa- 
tion and how it depends on the degree of 
similarity of the new situation to the old 
stimulus situation. 

(ii) An animal is presented alternately 
with two stimulus situations which are 
similar. In one, an experimentally de- 


fined response is rewarded, and in the 


other that response is either not re- 
warded or rewarded less than in the first. 
Through the process of ~eneralization, 
the effects of rewards and non-rewards 
in one stimulus situation influence the 
response strength in the other, but even- 
tually the animal learns to respond in 
one but not in the other, or at least to 
respond with different probabilities (rates 
or strengths). One then asks how the 
probability of the response in each situa- 
tion varies with the number of training 
trials, with the degree of similarity of the 
two situations, and with the amount of 
reward. 


We do not consider that these two 
kinds of experiments come close to ex- 
hausting the problems classified under 
the heading of generalization and dis- 
crimination, but we do believe that 
they are fundamental. Thus, the 
model to be described has been de- 
signed to permit analysis of these 
experiments. In the next section we 
will present the major features of the 
model, and in later sections we shall 





apply it to the above described ex- 
periments. 
THE MoDEL 


We shall employ some of the ele- 
mentary notions of mathematical set 
theory to define our model. A par- 
ticular stimulus situation, such as an 
experimental box with specific prop- 
erties (geometrical, optical, acoustical, 
etc.) is regarded as separate and dis- 
tinct from the rest of the universe. 
Thus, we shall denote this situation 
by a set of stimuli which is part of the 
entire universe of stimuli. The ele- 
ments of this set are undefined and we 
place no restriction on their number. 
This lack of definition of the stimulus 
elements does not give rise to any 
serious difficulties since our final results 
involve neither properties of individual 
elements nor numbers of such ele- 
ments. We next introduce the notion 
of the measureofaset. If the set con- 
sists of a finite number of elements, we 
may associate with each element a posi- 
tive number to denote its “weight”; 
the measure of such a set is the sum of 
all these numbers. Intuitively, the 
weight associated with an element is 
the measure of the potential impor- 
tance of that element in influencing 
the organism’s behavior. More gen- 
erally, we can define a density function 
over the set; the measure is the inte- 
gral of that function over the set. 

To bridge the gap between stimuli 
and responses, we shall borrow some . 
of the basic notions of Estes (2). 
(The concept of reinforcement will 
play an integral role, however.) It is 
assumed that stimulus elements exist 
in one of two states as far as the 
organism involved is concerned; since 
the elements are undefined, these states 
do not require definition but merely 
need labelling. However, we shall 
speak of elements which are in one 
state as being “conditioned” to the 
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response, and of elements in the other 
state as being “‘non-conditioned.” 

On a particular trial or occurrence 
of a response in the learning process, 
it is conceived that an organism per- 
ceives a sub-set of the total stimuli 
available. It is postulated that the 
probability of occurrence of the re- 
sponse in a given time interval is equal 
to the measure of the elements in the 
sub-set which had been previously con- 
ditioned, divided by the measure of 
the entire sub-set. Speaking roughly, 
the probability is the ratio of the im- 
portance of the conditioned elements 
perceived to the importance of all the 
elements perceived. It is further as- 
sumed that the sub-set perceived is 
conditioned to the response if that 
response is rewarded. 

The situation is illustrated in Fig. 1. 
It would be wrong to suppose that the 
conditioned and non-conditioned ele- 
ments are spatially separated in the 
actual situation as Fig. 1 might sug- 
gest; the conditioned elements are 
spread out smoothly among the non- 
conditioned ones. In_ set-theoretic 
notation, we then have for the proba- 
bility of occurrence of the response 


m(XM C) 
i iat 


where m( ) denotes the measure of 
any set or sub-set named between the 
parentheses, and where XMC indi- 
cates the intersection of X and C (also 
called set-product, meet, or overlap of 
XandC). We then make an assump- 
tion of equal proportions in the meas- 
ures so that 


m(X1 C) 
P= m(X) 


p= 


m(C) 


=m 





Heuristicaily, this assumption of 
equal proportions can arise from a 


fluid model. Suppose that the total 
situation is represented by a vessel 
containing an ideal fluid which is a 
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Fic. 1. Set diagram of the single stimulus 
situation S with the various sub-sets involved 
in a particular trial. C is the sub-set of 
elements previously conditioned, X the sub- 
set of S perceived on the trial. The sub-sets 
A and B are defined in the text. 


mixture of two substances which do 
not chemically interact but are com- 
pletely miscible. For discussion let 
the substances be water and alcohol 
and assume, contrary to fact, that the 
volume of the mixture is equal to the 
sum of the partial volumes. The vol- 
ume of the water corresponds to the 
measure of the sub-set of non-condi- 
tioned stimuli, S — C (total set minus 
the conditioned set), and the volume 
of the alcohol corresponds to the 
measure of the sub-set C of condi- 
tioned stimuli. The sub-set X corre- 
sponds to a thimbleful of the mixture 
and of course if the fluids are well 
mixed, the volumetric fraction of alco- 
hol in a thimbleful will be much the 
same as that in the whole vessel. 
Thus the fraction of measure of con- 
ditioned stimuli in X will be equal to 
the fraction in the whole set S, as 
expressed by equation (2). Our defi- 
nition of p is essentially that of Estes 
(2) except that where he speaks of 
number of elements, we speak of the 
measure of the elements. 

We next consider another stimulus 
situation which we denote by a set S’. 
In general this new set S’ will not be 
disjunct from the set S, i.e., S and S’ 
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will intersect or overlap as shown in 
Fig. 2. Wedenote the intersection by 


IT=SnS’. (3) 


We can now define an index of 
similarity of S’ to S by 


1S’ to 8) = @O. 


(4) 


In words this definition says that the 
index of similarity of S’ to S is the 
measure of their intersection divided 
by the measure of the set S’. (Our 
notation makes clear that we have 
made a tacit assumption that the 
measure of an element or set of ele- 
ments is independent of the set in 
which it is measured.) Definition (4) 
also gives the index of similarity of 
S to S’ as 
_ mI) 
a(S to S’) m(S) 
= MS) 55° to S). 


m( 


(5) 


From this iast equation it is clear that 
the similarity of S’ to S may not be 
the same as the similarity of S to S’. 
In fact, if the measure of the inter- 
section is not zero, the two indices are 
equal only if the measures of S and S’ 
are equal. It seems regrettable that 
similarity, by our definition, is non- 
symmetric. However, we do not care 
to make the general assumption that 
(a) the measures of all situations are 
equal and at the same time make the 
assumption that (b) measures of an 
element or set of elements is the same 
in each situation in which it appears. 
For then the importance of a set of 
elements, say a light bulb, would have 
to be the same in a small situation, 
say a 2’ X 2’ X 2’ box, as in a large 
situation, say a ballroom. Further 
this pair of assumptions, (a) and (b), 
leads to conceptual difficulties. 
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THE GENERALIZATION PROBLEM 


We are now in a position to say 
something about the first experimental 
problem described in the Introduction. 
An animal is trained to make a re- 
sponse in one stimulus situation and 
then his response strength is measured 
in a similar situation. After the ani- 
mai has been trained in the first situa- 
tion whose elements form the set S, a 
sub-set C of S will have been condi- 
tioned to the response as shown in 
Fig. 2. But part of the sub-set C is 
also contained in the second situation 
whose elements form the set S’; we 
denote this part by Cn S’. 

From the discussion preceding equa- 
tions (1) and (2), we can easily see 
that the probability of the response 
occurring in S’ is 

m(Cr S’) 
~ mS!) 
We now use the assumption of equal 
proportions so that 

m(Cr S’) et m(Cr I) us m(C) 
m(I) m(I) m(S) ° 
The first equality in this equation 


follows from the fact that the only 
part of C which is in S’ is in the inter- 


S S 


, 


(6) 





(7) 


S 

Fic. 2. Diagram of two similar stimulus 
situations after conditioning in one of them. 
The situation in which training occurred is 
denoted by the set S; the sub-set C of S 
represents the portion of S which was con- 
ditioned to the response. The new stimulus 
situation in which the response strength is to 
be measured is represented by the set 5’, 
and the intersection of S’ and S is denoted by 
I. 
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section J as shown in Fig. 2. The 
second equality in equation (7) is an 
application of our assumption that the 
measure of C is uniformly distributed 
over S and so the intersection contains 
the same fraction of measure of C as 
does the entire set S. 

If now we combine equations (6) 
and (7), we obtain 


,_ m1) m(C) 
mS) ms) © 


From equation (4) we note that the 
first ratio in equation (8) is the index 
of similarity of S’ to S, while from 
equation (2) we observe that the sec- 
ond ratio in equation (8) is merely 
the probability p of the response in S. 


Hence 
pb’ = 4(S’ to S)p. (9) 


Equation (9) now provides us with 
the necessary operational definition of 
the index of similarity, 7(S’ to S), of 
the set S’ to the set S. The proba- 
bilities p and p’ of the response in S 
and S’, respectively, can be measured 
either directly or through measure- 
ments of latent time or rate of re- 
sponding (1). Therefore, with equa- 
tion (9), we have an operational way 
of determining the index of similarity. 

As a direct consequence of our as- 
sumption of equal proportions, we can 
draw the following general conclusion. 
Any change made in a stimulus situa- 
tion where a response was conditioned 
will reduce the probability of occurrence 
of that response, provided the change 
does not introduce stimuli which had 
been previously conditioned to that re- 
sponse. This conclusion follows from 
equation (9) and the fact that we have 
defined our similarity index in such a 
way that it is never greater than unity. 

A word needs to be said about the 
correspondence between our result and 
the experimental results such as those 
of Hovland (3). Our model predicts 
nothing about the relation of the index 
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of similarity defined above to such 
physical dimensions as light or sound 
intensity, frequency,etc. In fact, our 
model suggests that no such general 
relation is possible, i.e., that any sen- 
sible measure of similarity is very 
much organism determined. There- 
fore, from the point of view of our 
model, experiments such as those of 
Hovland serve only as a clear demon- 
stration that stimulus generalization 
exists. In addition, of course, such 
experiments provide empirical rela- 
tions, characteristic of the organism 
studied, between the proposed index of 
similarity and various physical dimen- 
sions, but these relations are outside 
the scope of our model. 

We conclude, therefore, that our 
model up to this point has made 
no quantitative predictions about the 
shape of generalization gradients which 
can be compared with experiment. 
Nevertheless, the preceding analysis 
of generalization does provide us with 
a framework to discuss experiments on 
stimulus discrimination. In the fol- 
lowing sections we shall extend our 
model sc as to permit analysis of such 
expeviments. 


THE REINFORCEMENT AND Ex- 
TINCTION OPERATORS 


In this section we develop some re- 
sults that will be used later and show 
that the model of the present paper 
generates postulates used in our pre- 
vious paper (1). We shall examine 
the step-wise change in probability of 
a response in a single stimulus situa- 
tion S. We generalize the notions 
already presented as follows: Previous 
to a particular trial or occurrence of 
the response, a sub-set C of S will 
have been conditioned. On the trial 
in question a sub-set X of S will be 
perceived as shown in Fig. 1. Ac- 
cording to our previous assumntions, 
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the probability of the response is 


m(XAC) _ m(C) 
P= m(X) ~ m(S)" 


We now assume that a sub-set A of X 
will be conditioned to the response as 
a result of the reward given and that 
the measure of A will depend on the 
amount of reward, on the strength of 
motivation, etc. We further assume 
that another sub-set B of X will be- 
come non-conditioned as a result of 
the work required in making the re- 
sponse. For simplicity we assume 
that A and Bare disjunct. (The error 
resulting from this last assumption 
can be shown to be small if the meas- 
ures of A and B are small compared 
to that of S.) 

We extend our assumption of equal 
proportions so that we have 


m(ArC) “ m(Br C) Ms m(C) 
m(A) m(B) m(S) ° 


Now at the end of the trial being con- 
sidered, sub-set A is part of the new 
conditional sub-set while sub-set B is 
part of the new non-conditioned sub- 
set. Thus, the change in the measure 
of C is 


Am(C) = [m(A) — m(AnC)] 
—m(BNC) (12) 
= m(A)(1 — p) — m(B)p. 


This last form of writing equation (12) 
results from the equalities given in 
equations (10) and (11). if we then 
let 





(10) 


(11) 





_m(A) , _ m(B) 

m(S)’ m(S)’ 

and divide equation (12) through by 

m(S), we have finally for the change 

in probability: 
Am(C) 
m(S) 


We thus define a mathematical oper- 
ator Q which when applied to p gives 


(13) 





Ap = = a(l— p) — bp. (14) 


a new value of probability Qp effective 
at the start of the next trial: 


Qp=pt+a(i—p)— dp. (15) 


This operator is identical to the gen- 
eral operator postulated in our model 
for acquisition and extinction in a fixed 
stimulus situation (1). Hence, the 
set-theoretic model we have presented 
generates the basic postulates of our 
previous model which we applied to 
other types of learning problems (1). 
When the operator Q is applied n times 
to an initial probability po, we obtain 


O*Po = Pa = Pu — (Pa — Pog", (16) 


where p. = a/(a+5) and g=1—a—b. 

In the next section we shall apply 
these results to the experiment on 
stimulus discrimination described in 
the Introduction. 


THE DISCRIMINATION PROBLEM 


We are now in.a position to treat 
the second experimental problern de- 
scribed in the Intreduction. An ani- 
mal is presented alternately with two 
stimulus situations S and S’ which are 
similar, i.e., which have a non-zero 





Fic. 3. Set diagram for discrimination 
training in two similar stimulus situations, S 
and S’. The various disjunct sub-sets are 
numbered. Set S includes 1, 3, 5, and 6; 
S’ includes 2, 4, 5, and 6. The intersection J 
is denoted by 5 and 6. 7, the complement 
of J in S, is shown by 1 an@ 3; 7’, the com- 
plement of J in S’,isshow ~ 2and4. C, 
the conditioned sub-set in S, is represented by 
3 and 6, while the conditioned sub-set in 5S’, 
is represented by 4and 6. 7, is denoted by 3, 
T.' by 4, and J, by 6. 
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intersection. The rewards which fol- 
low occurrences of the response are 
different for the two situations, and 
we are interested in how the response 
strengths vary with training. At any 
point in the process, sub-sets of S 
and S’ will be conditioned to the re- 
sponse as shown in Fig. 3. We shall 
distinguish between that part of S 
which is also in S’ and that part 
which is not by letting J = Sn S’ and 
T = S— (SNS’) = S—TI. Wealso 
distinguish between the part of the 
conditioned sub-set C of S which is in 
I and that which is in T, by letting 
I,.= Cal and T,.=C-—(CnD=TAC. 
The probability of the response in S is 


sa m(C) _ m(T.) + m(I.) 
m(S) m(S) ; 
Then we let 
m(T-.) m(I,) 
“ m(T) " m(I) ' 


and, abbreviating 4(S to S’) with n, we 
may write (17) in the form 


pb = a(1 — ) + Bn. (20) 


We write the probability of the re- 
sponse in this form because we shall 
soon argue that the index 9 varies 
during discrimination training. First, 
however, we shall investigate the vari- 
ation of a and 6 with the number of 
training trials. From the definitions 
of a and £,.equations (18) and (19), 
we see that these variables are very 
much like our probability » of equa- 
tion (17) except that they refer to 
sub-sets of S rather than to the entire 
set. By strict analogy with the argu- 
ments in the last section, we conclude 
that 


Gn = Qa = aa — (ae — a)g”, (21) 


where a. = a/(a+5) and g=1—a—d. 
Now, 8, the fraction of conditioned 
stimuli in the intersection J, changes 
with each presentation of S’ as well as 





(17) 


(18) B= (19) 
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of S. Thus, for each presentation of 
S, we must operate on 8 twice, once 
by our operator Q which describes the 
effect of the environmental events in 
S, and once by an analogous operator 
Q’ which describes the effect of the 
events in S’. Hence, it may be shown 
that 


8, = (Q’Q)"Bo = Bx — (Ba — Bo) f", (22) 


where 

a’ +a(i—a’ — d’) 
Ba = fa’ +a(1 —a' —d’) 

+ b’ + b(1 — a’ — d’)} 
and where f = (1—a’ —b’)(1—a—b). 

It should be stressed that we are 
assuming that the response occurs and 
is rewarded to the same degree on 
every presentation of S. The same 
statement, mutatis mutandis, applies 
to S’. Without this assumption, we 
are not justified in applying the oper- 
ators Q and Q’ for each presentation. 
The probability is then the probability 
that the response will occur in an 
interval of time, h. The operational 
measure of this probability is the mean 
latent time, which according to the 
response model discussed earlier varies 
inversely as the probability (1). 

We now have cleared the way for 
discussing the central feature of our 
model for discrimination problems. 
We conceive that the measure of the 
intersection I of the two sets S and S’ 
decreases as discrimination learning 
progresses. This concept seems tc 
make sense intuitively since the meas- 
ure of any sub-set of stimuli indicates 
the importance of that sub-set in in- 
fluencing behavior. If an animal is 
rewarded for a response in S but not 
rewarded for it in S’, then the stimuli 
in J are unreliable for deciding whether 
or not to make the response. And it 
is just this ambiguity which causes 
the measure of the intersection to 
decrease with training. We shall de- 
scribe this change by introducing a 





» (23) 
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“discrimination operator,’’ denoted by 
D, which operates on the similarity 
index 9 each time the environmental 
event following the response changes 
from one type of event to another, 
e.g., from reward to non-reward. In 
the present problem, we are consider- 
ing alternate presentations of S and S’ 
and thus alternate occurrences of the 
events associated with the operators 
Q and Q’. So if n; is the ratio of the 
measure of J to that of S after the 
ith presentation of S, the ratio after 
the (¢ + 1)th presentation is 


nin = Dn. (24) 


Our next task is to postulate the form 
of the operator D. 

We find that neither experimental 
data nor our intuition is of much help 
in guiding our choice of such a postu- 
late. For mathematical simplicity we 
choose an operator which represents a 
linear transformation on 7. More- 
over, we wish to have an operator 
which always decreases 9 (or holds it 
fixed), but which will never lead to 
negative values of 7. Therefore, we 
postulate that 


Dn = kn, (25) 


where k& is a new parameter which is 
in the range between zero and 1. We 
then have 


in = Dn ase kno. (26) 


Combining equations (20), (21), (22), 
and (26), we have 


Pn = Q"a0(1 — Dm) + (Q’Q)"BoD"no 
= [ate — (ae — ao) g” ](1 — b*no) 
+ [Be — (Bo — Bo) f"Jk"mo. (27) 


This is our final expression for the 
variation of ~,, the probability of the 
response in situation S, as a function 
of the trial number ». This equation 
is composed of two major terms. The 
first term corresponds to the relative 
measures of the stimulus elements of 
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S which are not in S’ (the measure of 
T. divided by the measure of 5S). 
The second term corresponds to the 
relative measure of the elements in 
the intersection of S and S’ (the meas- 
ure of J, divided by the measure of S). 

Because of the symmetry between 
S and S’, we may write for the proba- 
bility in S’: 


Pa’ = [ae — (an — a’) g’* }(1—"m0’) 
+[Be— (Bx —Bo)f" jem’, (28) 


where a,’ = a’/(a’ + 6’), and g’ = 1 
— a’ — b’, and where m’ is the initial 
value of 


1’ = 9(S’ to S) 
_ md) _ m(S) 
m(S’)  m(S") » 


We shall now consider some special 
examples for which certain simplifying 
assumptions can be made. 

(a) No conditioning before discrimi- 
nation training. If no previous con- 
ditioning took place in either S or S’, 
it seems reasonable to assume that 
the “operant” levels of performance 
in the two situations are the same. 
Moreover, in view of our assumptions 
of equal proportions, we may assume 
that initially: 


m(C) pi m(T-) 2 m (I) 
m(S) m(T)~ m(I) 
m(T.)  m(C’) 
=‘m(T") mS’) * 


Hence, from equations (17), (18), and 
(19), we have po = ao = ay = fo. 
Moreover, inspection of equation (27) 
shows that, except when k = 1, we 
have ~. = a, and in like manner 
from equation (28) for k # 1, we have 
Po’ = aq’. In Fig. 4 we have plotted 
equations (27) and (28) with the above 
assumptions. The values a = 0.12, 
b = 0.03, po = 0.05, m = m’ = 0.50, 
k = 0.95 were chosen for these calcu- 
lations. As can be seen, the proba- 
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Fic. 4. Curves of probability, » (in S), 
and p’ (in S’), versus trial number, n, for 
discrimination training without previous con- 
ditioning. It was assumed that the response 
was rewarded in S but not rewarded in S’. 
Equation (27), equation (28), and the values 
po = po’ = 0.05, a = 0.12, a’ = 0, b= bd’ 
= 0.03, 70 = m’ = 0.50, and k = 0.95 were 
used. 


bility of the response in S is a mono- 
tonically increasing, negatively accel- 
erated function of the trial number, 
while the probability in 5S’ first in- 
creases due to generalization, but then 
decreases to zero as the discrimination 











Fic. 5. Reciprocals of probability, p, of 
the response in S, and p’, of the response in S’, 
versus trial number, m, for discrimination 
training without previous conditioning. In 
the model described earlier (1), mean latent 
time is proportional to the reciprocal of prob- 
ability. The curves were plotted from the 
values of probability shown in Fig. 4. 
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is learned. These curves describe the 
general sort of result obtained by 
Woodbury for auditory discrimination 
in dogs (4). 

We have argued (1) that the mean 
latent time varies inversely as the 
probability. Thus in Fig. 5 we have 
plotted the reciprocals of , and pp’ 
given in Fig. 4. These curves exhibit 
the same general property of the ex- 
perimental curves on running time of 
rats obtained by Raben (5). 

(b) Complete conditioning in S before 
discrimination training. Another spe- 











Fic. 6. Curves of probability, », and its 
reciprocal versus trial number, , for the case 
of complete conditioning in S before the dis- 
crimination training. Equation (27) with the 
values ~. = 1, Bo = 0, m = 0.80, & = 0.90, 
and f = 0.50 were used. 5 


cial case of interest is that in which 
the set S is completely conditioned to 
the response before the discrimination 
experiment is performed. In this case, 
a = Bo = Po = Po. In Fig. 6 we have 
plotted p, and 1/p, with these condi- 
tions and the values p,. = 1, B. = 0, 
n = 0.80, k = 0.90, and f = 0.50. 
The curve of 1/p versus m is similar 
in shape to the experimental latency 
curve obtained by Solomon (6) from 
a jumping experiment with rats. 

(c) Limiting case of S and S' identi- 
cal. Another limiting case of the kind 
of discrimination experiment being 
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considered here obtains when we make 
the two stimulus situations S and S’ 
identical.. The problem degenerates 
into one type of partial reinforcement 
where, for example, an animal is re- 
warded on every second trial in a fixed 
stimulus situation. The intersection 
I of S and S’ is of course identical to 
both S and S’. Thus the measure of 
I must equal the measure of S. From 
equation (5), we have 
m(I) 


=—~=1, 

7” m(S) ~ 
while according to our postulate about 
the operator D, equation (26), the 
similarity index varies from trial to 


trial: 
a = k ™No- (32) 


For S and S’ identical, the above two 
equations are incompatible, unless we 
take k = 1. Thus, we are forced to 
assume that k depends on how many 
cues are available for discrimination 
in such a way that k = 1 when none 
are available. Moreover, since J and 
S are identical, the mieasure of T, the 
complement of J in S, must be zero. 
Since T, is a sub-set of T, the measure 
of T. must also be zero. Therefore, 
equations (17) and (19) give in place 
of equation (20) 


b = Bn. (33) 


But we have just argued that for S 
and S’ identical, we haven = 1. Thus 


(31) 


p=8. (34) 
Equation (22) gives us then 
Pu = (Q'Q)"bo = Pa — (Pa — po) f*. (35) 


This equation agrees with our previous 
result on partial reinforcement (1). 
(d) Irregular presentations of S and 
S’. In most experiments, S and 5S’ 
are not presented alternately, but in 
an irregular sequence so that the ani- 
mal cannot learn to discriminate on 
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the basis of temporal order. A simple 
generalization of the above analysis 
will handle the problem. The usual 
procedure is to select a block of (7 + j’) 
trials during which S i is presented j . 
times and S’ presented j’ times. The 
actual sequence is determined by draw- 
ing “S balls” and “‘S’ balls’’ at random 
from an urn containing j “S balls” 
and j’ “S’ balls.” This sequence is 
then repeated throughout training. 
In our model, we can describe the 
effects on the probability of a known 
sequence by an appropriate applica- 
tion of our operators Q, Q’, and D for 
presentations of S, presentations of 
S’, and shifts from one to the other, 
respectively. A less cumbersome 
method provides a reasonable approxi- 
mation: for each block of (j + 7’) 
trials we describe an effective or ex- 
pected new value of probability by 
applying Q to its operand j times, Q’ 
to its operand j’ times, and D to the 
index » a number of times determined 
by the mean number of shifts from S 
to S’. For the special case of j = 7’, 
the mean number of shifts is 7. Since 
previously, we applied D to 9 for each 
pair of shifts, we write for the (i+1)th 
block of (27) trials 


Pixs = Qiai(1 — Diy.) 
+ (Q’Q)8:Di"y;. (36) 


The rest of the analysis exactly par- 
allels that given above for the case of 


‘alternate presentations of S and S’. 


The results will be identical except 
for the value of & involved in the 
operator D. 

SUMMARY 


A mathematical model for stimulus 
generalization and discrimination is 
described in terms of simple set-theo- 
retic concepts. An index of similarity 
is defined in terms of the model but is 
related to measurements in generaliza- 
tion experiments. The mathematical 
operators for acquisition and extinc- 
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tion, discussed in an earlier paper (1), 
are derived from the set-theoretic 
model presented here. The model is 
finally applied to the analysis of ex- 
periments on stimulus discrimination. 


[MS. received October 13, 1950] 
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PERCEPTION, ASSOCIATION, AND REASONING 
IN ANIMAL FIXATIONS 


BY AL EGLASH 
University of Michigan 


INTRODUCTION 


Maier has summarized the work he 
and his students have done in producing 
abnormal fixations in animals, defining 
abnormal fixation as “the persistence of 
an unadaptive response . . . when an 
adaptive one is possible” (6, p. 33). 
Mowrer has referred tu the problem of 
explaining such behavior at the human 
level as “the absolutely central and 
most critical issue in clinical theory 
. . . why so-called neurotic behavior 
is at one and the same time self-defeat- 
ing and yet self-perpetuating, instead of 
self-eliminating” (10, p. 434). 

What answers have been offered to 
meet this issue? At the human level 
neurotic behavior has been explained 
by various personality theories, includ- 
ing the varieties of psychoanalytic the- 
ory reviewed by Mullahy (11). At- 
tempts to explain repetitive, self-defeat- 
ing behavior at the animal level are 
usually integrations of “learning theory 
and the neurotic paradox,” as Mowrer 
(9) phrases it. 

Theories of animal fixations may be 
pictured as forming a double-Y maze. 
The first fork is the dichotomy of those 
explanations which postulate a change 
in the motivational state of the animal, 
and those which do not. Along the 
former path is a second forking which 


1 This article grew out of discussions be- 
tween Miss Victoria Lewicki, Mr. Paul Ellen, 
and myself. Without their stimulating sug- 
gestions, the paper could not have been writ- 
ten. After it had been completed, Dr. N. R. 
F. Maier was kind enough to read it and to 
criticize the presentation. This does not mean 
that either Miss Lewicki, Mr. Ellen, or Dr. 
Maier are in agreement with the ideas pre- 
sented here. 


Farber has described (1, pp. 113-114). 
This is the dichotomy of anxiety-reduc- 
tion theory, espoused by Farber, Mow- 
rer (8), and Miller (7), and Maier’s 
frustration theory. The anxiety-reduc- 
tion view is that the fixated animal’s 
goal is to reduce anxiety, while Maier, 
on the other hand, offers the more ex- 
treme view that “frustration-instigated” 
behavicr is “behavior without a goal.” 
Both theories agree that sometimes dur- 
ing the experimental procedure the ani- 
mal loses sight of its original goal of 
avoiding punishment. 

Is this a necessary assumption? Must 
we choose between these two theories, 
the one positing that the goal changes 
to anxiety-reduction, the other that the 
goal disappears altogether? Or can we 
accept the animal’s motivation as a con- 
stant throughout the experimental pro- 
cedure, and assert that even the fixated 
animal has as its goal the avoidance of 
the locked window? 

To assert this is to backtrack to the 
first Y of our maze, where we found 
this one path assuming a motivational 
change, and to take the other branch, 
in which motivation is assumed to be 
a constant, and the fixation is explained 
in terms of cognitive difficulty. This is 
a path suggested by Hilgard (2, pp. 
340-341). Only if this cognitive alley 
ends in a cul-de-sac are we justified in 
postulating the more radical assump- 
tion that the animal’s goal changes. 


Procedure Used by Maier 


Placed on the Lashley apparatus, the ani- 
mal must jump from a small stand through 
either of two card-covered windows. If 
the correct card is struck, the animal lands 
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on the feeding platform. If the incorrect 
card is struck, the animal receives a bump 
on the nose from the locked card and falls 
into a net below. If one window is con- 
sistently locked (position-reward problem), 
the animal learns to jump to the other one 
(position-reward response), regardless of 
which card covers it; if one card is consist- 
ently locked (symbol-reward problem), the 
animal learns to jump to the other one 
(symbol-reward response) regardless of 
which window it covers. In the insoluble 
problem the cards are locked randomly, 
and the animal eventually settles down to 
a stereotyped response, jumping consist- 
ently to one window or card. 

Once the animal has successfully learned 
the position-reward response (or, in the 
insoluble problem, developed a position 
stereotype), it is given a soluble problem. 
If the animal, after being given the solv- 
able problem, fails to abandon its position 
response in 200 trials, it has met the crite- 
rion of abnormal fixation. 


Application of perception, association, 
and reasoning (PAR). 


1. Perception: When animals go from 
a situation of 50 per cent random pun- 
ishment (insoluble problem) to a situa- 
tion of 50 per cent orderly punishment 
(symbol-reward problem), they must 
first perceive that the pattern of punish- 
ment has changed. 

2. Association: In the symbol-reward 
problem, four associations are possible: 


(a) Positive card with reward, on 
habitual side. 

(b) Positive card with reward, on 
opposite side. 

(c) Negative card with punish- 
ment, on habitual side. 

(d) Negative card with punish- 
ment, on opposite side. 


3. Reasoning: Although the rat may 
learn through association that the posi- 
tive card will be rewarding and the 
negative card punishing when the cards 
are in the window to which the animal 
habitually jumps, this learning does not 
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necessarily include the knowledge that 
the positive and negative cards will also 
reward and punish, respectively, when 
in the other window. The process by 
which the animal generalizes, hypothe- 
sizes, or infers the pattern of reward 
and punishment at the nonhabitual win- 
dow is what is being termed reasoning, 
because the animal must put two sepa- 
rate experiences together if the proper 
significance is to be discovered (3, p. 
336; 4, p. 375). 


FINDINGS AND INTERPRETATIONS 


1. Comparison of Randomly-Punished 
with Rewarded Animals 


(a) Differential number of fixations. 
The basic experimental design upon 
which frustration theory is based is one 
in which a group of animals is presented 
with the no-solution problem, involving — 
50 per cent random punishment, while 
a control group is given a solvable prob- 
lem. During the final 30 criterion runs 
in this situation, the control group, hav- 
ing solved the problem, is never pun- 
ished. Both groups are then given a 
symbol-reward problem to solve. Maier 
summarizes the results of three experi- 
ments of this kind: 


“". . of the 34 rats that developed posi- 
tion stereotypes in the no-solution problem 
only 10 (29.4 per cent) were able to aban- 
don this response and adopt a symbol- 
reward response in 200 trials or less. How- 
ever, among the 35 rats that acquired a 
position-reward response, 26 (74.3 per 
cent) were able to change to a symbol- 
reward response .. .” (6, p. 32). 


Maier points out that the experimen- 
tal group had the more frustrating ex- 
perience and that this accounts for the 
group differences. However, the experi- 
mental group also had an entirely dif- 
ferent problem to solve, one which was 
more difficult than the problem facing 
the control animals. 

First, the problems differ with respect 
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to the factor of perception. The posi- 
tion-reward animals (control group) 
have been experiencing 100 per cent 
reward for the last 30 jumps. Sud- 
denly they experience 50 per cent or- 
derly punishment. The only discrimi- 
nation they are required to make in 
order to realize that the objective situ- 
ation has changed is between no pun- 
ishment and 50 per cent punishment. 
On the other hand, the randomly-pun- 
ished animals (experimental group), as 
Hilgard (2, pp. 303-304) points out, 
must perceive the more subtle change 
from 50 per cent random punishment 
to 50 per cent orderly punishment. Un- 
less they are able to discriminate this 
difference, they have no reason to drop 
their position stereotype and to try the 
alternate window. The window had 
been tried in the past and found want- 
ing. Why try it again? 

Moreover, perceiving that the objec- 
tive situation has somehow changed is 
not enough. Certain specific assccia- 
tions, which the control group need not 
make, must be built up by the experi- 
mental animals. This is the double 
association of positive card with reward 
and negative card with punishment. 
For the control animals a simple asso- 
ciation of negative card with punish- 
ment is enough. This one association 
tells them that whereas it is still all 
right to jump to their habitual window 
when the positive card is there, they 
had better jump elsewhere when the 
negative card covers it. This “else- 
where” can only be the alternative win- 
dow. 

Finally, the experimental group must 
hypothesize or infer that the same pat- 
tern of reward and punishment which 
now holds at the habitual window will 
also hold at the other window. This 
may seem an obvious, even automatic, 
inference to the experimenter, to whom 
it might never occur to give orderly pun- 
ishment at one window and random pun- 
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ishment at the other. It is an easy 
problem from the experimenter’s point 
of view, not because of greater intelli- 
gence, but because he can peek behind 
the scenes and perceive the pattern 
which the animal (except in guidance, 
as we shall see later) must reason out. 

The experimental group, in short, is 
faced with the more difficult problem. 
In order to master the symbol-reward 
situation, they must solve all three prob- 
lems of PAR. Unless it can be demon- 
strated that the animal has done this, 
any conative theory of fixation is super- 
fluous. Maier, in stating that the ani- 
mal jumps to the habitual window be- 
cause it must, in spite of learning, is 
implying that the animal has indeed 
solved all three problems. Is there any 
evidence that the fixated animal has 
solved any of the three? 

There is. Maier’s evidence for this 
is the next experimental finding to be 
discussed. 

(b) Differential resistance to positive 
and negative cards. Maier offers evi- 
dence suggesting that the animal has 
solved the first two problems. This 
evidence consists in the fact that even 
fixated animals show more resistance to 
jumping whenever the negative card is 
in their habitual window. 

Maier interprets this to mean that 
the animals have learned the new prob- 
lem, but that because of the compulsive 
nature of the fixation, they are unable 
to behave in accordance with this learn- 
ing. On the other hand, it may be that 
although all animals successfully asso- 
ciate the positive card with reward and 
the negative card with punishment when 
on the habitual side, some are unable 
to generalize to the alternative side. 

This is the point Hilgard makes, and 
is the crucial point of this entire paper. 
To give a human analogy, similar to 
Hilgard’s (2, pp. 340-341), we might 
suppose that a gambler has in the past 
found he could do no better than chance 
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either with dice or with horse-racing. 
Suddenly he finds that the situation has 
changed, and that he can now predict 
winners at the race track. The ability 
to predict in this situation would be 
little incentive for him to stop betting 
on the horses in favor of returning to 
dice. It is, in fact, to his advantage 
to stay where he is. 

In many respects.this is the position 
of the fixated animals. If they continue 
jumping to their habitual window, they 
can successfully predict the occurrence 
of reward and punishment, while their 
experience during the insoluble problem 
has taught them that if they revert to 
the other window, they are unable to 
predict. 

This prediction may seem worthless, 
since the animal, like the gambler, has 
no control over the outcome. But the 
animal does have control over the 
amount of punishment it must endure 
from the locked window, just as the 


gambler can control the amount he 


stands to win or lose on a race. Be- 
cause th: animal has abortive jumps 
within its repertoire, it too is placing 
a bet. An abortive jump, in which 
the animal may jump so softly that 
it fails to reach the window, pro- 
tects it against punishment upon en- 
countering a locked window, but pre- 
vents it from gaining access through 
an unlocked window. If the animal 
should make an abortive jump at a time 
when the window is locked, it suffers 
less punishment. If it should make an 
abortive jump at a time when the win- 
dow is unlocked, it suffers more punish- 
ment and gains less reward. Conse- 
quently it is to the rat’s advantage to 
know when the window is locked. It 
can achieve this advantage only at the 
habitual window. There is no incentive 
to give this up. 

Moreover, the animal may make an 
interpretation of the pattern of reward 
and punishment entirely different from 
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that which a human would make, though 
perhaps not so different from that which 
man would have made in his early cul- 
tural development before magic was su- 
perseded by science. The fixated ani- 
mal may even learn that jumping at the 
negative card makes the positive card 
appear. 

In brief, then, the difference in the 
number of fixations occurring between 
control and experimental animals might 
be explained by the greater difficulty of 
one of the problems. 


2. The Non-Normal Distribution 


In an ordinary learning problem one 
expects the scores made by the total 
group to form a normal distribution, 
reflecting individual differences in the 
ability measured. But when giving con- 
trol animals the symbol-reward prob- 
lem, Maier finds that they split into two 
groups, those who learn the problem 
fairly rapidly (within 100 trials), and 
those who never learn it (6, p. 34). 
Maier interprets this to mean that the 
frustration which the rats endured 
caused some of them to switch from 
motivated to frustrated behavior. 

However, if a minimum amount of 
PAR abilities is necessary in order to 
solve the symbol-reward problem, and 
if some animals have less than this mini- 
mum, then the dichotomous distribution 
is explainable without the hypothesis of 
a motivational change. 

Moreover, there are individual differ- 
ences, not only in PAR abilities, but 
also in variability: From the time of 
initial training on the Lashley appara- 
tus, some animals resist the experiment- 
er’s efforts to get them to jump to their 
non-preferred window. One can expect 
a few animals who are too persistent in 
their habits to change when the situa- 
tion demands it. Since this persistency 
appears in the earliest part of their 
training, it is apparently innate and not 
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the result of any frustrating experiences 
the animals have encountered. 

Individual differences in the ability to 
generalize and to adapt may thus ac- 
count for the non-normal distribution 
which Maier obtains. 


3. The Differential Effect of Rewarding 
the Preferred and the Non-Preferred 
Window 

From the time the animal begins its 
training on the Lashley apparatus, it 
shows a preference for one of the two 
windows, often so strong that unless 
forced to do so the animal will never 
try an alternate window. When the ex- 
perimenter trains the control group to a 
position-reward response, he must dc- 
cide if he will train the animals to their 
preferred side or to the non-preferred. 

If he decides to train them to their 
preferred window, little or no training 
in discrimination is required. Both 
windows are left unlocked; the animal 
jumps to one; thereafter that window 
continues to be unlocked, and the ani- 
mal will usually jump there consist- 
ently. 

For the animals whose correct win- 
dow is opposite their preference, both 
windows are locked; the animal jumps 
to one, and thereafter that window con- 
tinues to be locked. Thus, these ani- 
mals are trained to the non-preferred 
window by making them experience 
their preferred window as 100 per cent 
punitive. 

Ordinarily we should expect that ani- 
mals who have been forced to jump to 
a non-preferred window would find it 
easier to surrender this in favor of a 
new. response than would animals who 
have been jumping with equal frequency 
to their preferred window. However, 
Maier has shown that 


“. , animals trained in position-reward 
responses that correspond to their natural 
preference are more able thereafter to ac- 
quire a symbol-reward response than ani- 


mals trained to a position-reward response 
opposed to their expressed natural prefer- 
ence” (6, p. 35). 


Maier explains this in terms of the 
greater frustration endured by the non- 
preference group. However, the signifi- 
cance of this greater frustration or pun- 
ishment may lie in its effect, not upon 
the motivational state of the animals, 
but upon their understanding of the sit- 
uation in which they find themselves. 

Since the preference animals never 
experience the alternate window as pu- 
nitive, when their habitual window be- 
gins punishing them each time the nega- 
tive card is in it, they naturally attempt 
to avoid that card by jumping elsewhere, 
and this can be only to the other win- 
dow. 

The non-preference group also finds 
the habitual window punishing them 
whenever the negative card is in it. 
If they stay where they are, they too 
will receive 50 per cent punishment. 
But they have previously learned that 
if they try the alternate window, their 
preferred window, they will fare even 
worse. Not knowing the mind of man, 
and unable from their rats’-eye view to 
perceive the total pattern of reward and 
punishment, they sec no reason for sup- 
posing that, when the habitual window 
changes from purely rewarding to fre- 
quently punishing, the other window 
changes from purely punishing to fre- 
quently rewarding. 

In other words, the difference be- 
tween preference and non-preference 
group may be that one has no cognitive 
map of the alternate window, so that 
it remains open for exploration, while 
the other has a very clear map of that 
window as leading only to punishment, 
and hence as something to be avoided. 


4. Comparison of Symbol-Reward and 
Position-Reward Problems 


In each experiment mentioned thus 
far, the animal with a position response 
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had to learn a symbol response. The 
habitual window is locked whenever the 
negative card is in it; so long as the rat 
retains its habitual response, it experi- 
ences 50 per cent orderly punishment. 
In other experiments, however, the ani- 
mal is given a new position response to 
learn. The habitual window is perma- 
nently locked, and the rat experiences 
100 per cent punishment so long as it 
keeps the old position habit. 

In comparing the behavior of animals 
on each of these problems, Maier notes 
that: 


“punishment on 100 per cent of the trials 
causes fewer animals to abandon a response 
than does punishment on 50 per cent of the 
trials... . [But] 100 per cent punishment 
makes . . . for more rapid learning than 
does 50 per cent punishment. 

“When these apparently contradictory 
results are combined we may say that 100 
per cent punishment makes either for fairly 
rapid response alteration or no alteration 
at all, whereas 50 per cent punishment 
makes for slower but surer learning” (6, 
pp. 37-38). 


There are two results to be explained 
here: 


1. More animals fixate in the 100 per 
cent punishment situation. 

2. When learning occurs at all, it oc- 
curs more rapidly in the 100 per cent 
punishment situation. 

Maier points out that the second re- 
sult is consistent with prevalent learn- 
ing theories, and applies to the animals 
who continue to be motivated. He im- 
plies that the first result is inccnsistent 
with learning theory, but consistent with 
frustration theory, and applies to the 
animals who change to frustrated be- 
havior. 

Both results, however, can be ex- 
plained without postulating a motiva- 
tional change: 

1. Why do fewer animals solve the 
106 per cent punishment problem? 
What is required of them? First, they 


429 


must perceive the change from 50 per 
cent to 100 per cent punishment. Since 
in the former situation they may ex- 
perience as many as three consecutive 
punished trials in a given day’s run of 
10 jumps, their perceptual problem is 
one of discriminating between three and 
more-than-three. Secondly, with respect 
to associative learning the animals are 
in an impossible situation, for they are 
given no opportunity to associate reward 
with the correct response; that is to say, 
they cannot learn that the alternative 
window will be rewarding until after 
they jump to it, so they never experience 
a pattern of reward and punishment. 
Since the rat cannot be expected to as- 
sociate or learn what it does not experi- 
ence, these animals are not in a learning 
situation at ali. They are being called 
on to perform, not learn. Maier him- 
self implies that these animals are not 
in a learning situation when he refers 
to their task in terms of “response al- 
teration,” and to the symbol-reward 
group in terms of “learning.” 

These symbol-reward animals, experi- 
encing 50 per cent punishment, are 
given the opportunity to associate the 
positive card with reward and the nega- 
tive card with punishment each time the 
card appears in the habitual window. 
They need make no alteration in their 
habitual mode of jumping in order to 
make this double association. Faced 
with a comparatively simple problem, 
more of them solve it. 

2. Why, then, are they slower in 
changing their behavior? It is because 
time is required, first, to perceive the 
subtle change from 50 per cent random 
punishment to 50 per cent orderly, and 
secondly, to build up the double asso- 
ciation. 

It is as though we were to ask one 
group of students to memorize nonsense 
syllables, and to ask a similar group to 
solve a problem in calculus, e.g., to dif- 
ferentiate x*. Since those comparatively 
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few students with the ability to solve 
calculus problems would quickly answer 
“2x,” while the majority without this 
ability would never solve the problem, 
this would make either for fairly rapid 
response or for none at ali. On the 
other hand, the first problem would 
make for slower but surer learning. 
Similarly, most of the symbol-reward 
group solve the problem, but require 
time, while those few position-response 
animals who solve their problem do so 
quickly. 
5. Guidance 


Maier discusses several ineffective at- 
tempts to break fixations, including the 
use of shock, vacation, and a variety of 
experiences on the Lashley apparatus. 
Since none of these procedures changes 
the nature of the problem confronting 
the animal, it is consistent with PAR 
theory that they were ineffective. He 
then describes the method of guidance, 
in which the animal is forced to jump to 
the correct window, and its effect upon 
fixations: 


“. . . 23 rats with position fixations could 
not execute a symbol-reward discrimination 
response in 200 trials but shifted to a per- 
fect discrimination after a few trials of 
guidance... . 

“Only 2 out of 28 guided animals .. . 
learned the symbol-reward discrimination 
response by guidance alone . . .” (6, pp. 
53-57). 


Again there are two separate points 
to be explained: 


1. How is guidance effective in break- 
ing fixations which are otherwise so per- 
sistent? 

2. Why is guidance usually ineffec- 
tive by itself in teaching a new re- 
sponse? 

Maier explains these two points by 
suggesting that guidance does not teach 
the animal anything, but simply tears 
it away from the fixated response (6, 
pp. 54-58). An alternative explan~tion 
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is that guidance enables the animal to 
experience what it cannot infer. 

1. Why does the fixated animal drop 
its fixation under the influence of guid- 
ance? The three basic assumptions 
mentioned at the beginning of this pa- 
per include four associations which the 
animal needs in order to understand the 
pattern of reward and punishment pre- 
vailing in the symbol-reward problem. 
According to PAR theory, the fixated 
animal builds up two of these associa- 
tions by means of direct experience, and 
fixation is the result of failure to infer 
the remaining two. Guidance makes 
it unnecessary for the animal to make 
one of these two remaining associations, 
namely, the association of positive card 
with reward at the non-habitual win- 
dow. Once the animal has learned 
through the direct experience of which 
guidance consists that it can avoid pun- 
ishment by jumping to the alternative 
window whenever the negative card is 
in the habitual window, it has sufficient 
understanding of the situation to break 
the fixation. The animal has not, how- 
ever, solved the symbol-reward problem. 

2. Why not? Why does the rat con- 
tinue to make errors? What more need 
it know? The answer, of course, is that 
it has not yet learned that the negative 
card will be punitive at the alternative 
window. The animal cannot learn this 
while jumping to the habitual window, 
nor can it learn this from the guided 
trials. Only trial and error can give 
this fourth and final association. Once 
the association is made, the animal can 
put the parts together into a completed 
4-piece jigsaw puzzle, to form a cogni- 
tive map of the terrain. 


6. The Effect of Duration of Punish- 
ment upon Fixations 


In each experiment referred to so far, 
amount of frustration and difficulty of 
the problem varied concomitantly, so 
that those animals who were forced 
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to endure more punishment were also 
given the more difficult problem to 
solve. In an experiment which was ob- 
viously not designed to test PAR the- 
ory, Maier and Feldman (5) held one 
factor constant for two groups and 
varied the other: 


“Litters were divided into three groups; 
the first group was subjected to the insol- 
uble problem for 8 days, the second for 16 
days, and the third for 24 days. ... As 
controls, three other groups, made up of 
divided litters, were trained by reward and 
punishment to form position responses. 
One group received 8 days of practice, the 
second 16 days of practice, and the third 
24 days of practice... . 

“After the position responses were estab- 


lished in both the experimental and the’ 


control groups, the problem was changed 
so that the animals were required to learn 
the symbol-reward discrimination problem. 
. . . Guidance and trial and error were 
alternated. [The 8, 16, and 24 day reward 
groups required 22.2, 20.2, and 19.8 trials 
(average), respectively, to abandon the po- 
sition response. ] 

“None of the differences in average 
scores is significant, but it is of -interest to 
note that increasing the number of days 
of training has not increased the strength 
of the position-reward response. [The 8, 
16, and 24 day randomly-punished groups 
required 30.1, 49.0, and 44.4 trials (aver- 
age), respectively, to abandon the position 
stereotypes. ] 

“Both the 16-day and the 24-day groups 
are [significantly] different from the 8-day 
group. . . . All groups are also significantly 
different from the groups trained in posi- 
tion rewards; .. .” (6, pp. 60-62). 


We can now make comparisons within 


control and experimental groups, as well 
as between them: 


1. The control groups all solved the 
problem with equal facility. Since these 
three groups had similar problems to 
solve, the result is in accordance with 
PAR theory. 

2. The experimental animals required 
more trials in which to solve the prob- 
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lem than did the control animals. Since 
this latter group had the easier problem, 
this too is in accordance with PAR the- 
ory. 
3. That there is no difference between 
the 16- and the 24-day experimental 
groups is what we would expect. But 
that these two groups would differ sig- 
nificantly from the 8-day animals is not 
what PAR theory would predict. In 
what way is the latter’s problem differ- 
ent from that of the other randomly- 
punished animals? 

First, with respect to the experience 
at the habitual window, this group ex- 
perienced less punishment than did the 
others, and should have beer: less rather 
than more rapid in giving up the win- 
dow; or, if frequency of punishment is 
irrelevant, the three groups should have 
been equal. 

Is there any way, however, in which 

the greater punishment endured at the 
habitual window by the 16- and 24-day 
animals might have made the problem 
a more difficult one for them? While it 
seems possible that the punishment had 
an emotional effect upon the animals, so 
that there was a loss in the ability to 
perceive, associate, or reason, evidence 
is against this: 
“ .. [In] tests made on the Maier three- 
table reasoning problem . . . 14 rats with 
position fixations made an average of 3.3 
incorrect responses in 12 tests... . Six 
nonfixated rats made an average of 3.2 
errors... . 

“Further, aa animal may learn a symbol- 
reward discrimination for walking and re- 
tain the position fixation for jumping” (6, 
pp. 47-48). 


That the fixated animals could both 
reason and learn satisfactorily suggests 
that the punishment suffered during the 
insoluble problem did not affect PAR 
abilities. The use of control animals 
suggests that the greater frequency of 
reward experienced by the 16- and 24- 
day groups is also irrelevant. 
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Apparently the briefer experience of 
the 8-day animals at the habitual win- 
dow sheds no light on their greater abil- 
ity to solve the symboi-reward problem. 
Let us turn, then, to the respective ex- 
periences of the animals at the non- 
habitual window. 

If the 16-day group had experienced 

this window as punitive more frequently 
than the 8-day animals, they would be 
more reluctant to jump to it. However, 
since animals in the insoluble problem 
seldom jump to the non-habitual win- 
‘ow after the Sth or 6th day, all three 
groups probably experienced this win- 
dow equally often. Moreover, the 8-day 
group had experienced the window as 
punitive only a few days before being 
presented with the symbol-reward prob- 
lem, while for the 24-day group almost 
20 days had elapsed. If the principle of 
recency is relevant here, the 8-day group 
should have shown greater reluctance 
to jumping at the non-habitual window 
than the others. 

Both on the basis of the lesser pun- 
ishment endured at the habitual win- 
dow, and on the basis of the more recent 
punishment endured at the non-habitual 
window, PAR theory would predict that 
the 8-day group would find it more dif- 
ficult than the other animals to sur- 
render their position response in favor 
of a symbol response. Since tke experi- 
mental results are directly opposed to 
this prediction, we are left with this 
question: Why does the strength of a 
position-stereotype response increase as 
the animal spends more time in the in- 
soluble problem, while the strength of 
a position-reward response does not in- 
crease with time spent in the reward 
situation? 


7. The Open-Window Situation 


One way of equating the difficulty of 
the problems facing ntrol and experi- 
mental groups is to increase the diffi- 
culty for the control animals, as in 


At EcLasH 


the Maier-Feldman study, if we use 
the 8-day punished animals as controls. 
Another way is to decrease the diffi- 
culty of the problem facing the experi- 
mental animals. This can be done by 
removing the card from the non-habitual 
window. Maier offers two photographs 
illustrating the behavior of a fixated ani- 
mal when confronted with an open win- 
dow on its non-habitual side: 


“we see the rat observing the food in the 
open window. The behavior of the animal 
on these occasions is one of sniffing and 
reaching out toward the open window. .. . 
Yet, invariably, the rat finally turns and 
jumps in conformity with its pesition fixa- 
tion . . .” (6, between pp. 114 and 115). 


The crucial point of this paper has 
been that the fixated animal does not 
know what awaits it at the non-habitual 
window. This is not true in the open- 
window situation. The card is removed, 
just as it was during preliminary train- 
ing, when the animal learned that with- 
out the card the window was always re- 
warding, never punishing. Food is in 
sight, well within the animal’s visual 
range, at the same distance, in fact, as 
the cards have been. The sniffing which 
the rat does shows that the food is also 
within olfactory range. 

The animal is thus given the answer 
to the problem we have been postulating 
it is unable to solve. It need make no 
inference now, nor do any more than 
follow its nose. Yet it fails to do this. 
Why? 

Perhaps the tremendous amount of 
punishment the animal has received has 
changed the interpretation it makes 
even of direct sensory stimuli. By first 
cajoling the rat into jumping, with the 
presentation of consistent reward, and 
then rigging up a punitive situation from 
which it could not escape, the experi- 
menter tricked the animal. The rat 
may now have developed a generally 
suspicious attitude toward the environ- 
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ment, seeing it as threatening despite 
any surface appearance. 

The thesis that the animal no longer 
perceives the same meaning in objects 
as it once did has evidence against it. 
On the three-table reasoning problem 
referred to above, the animal perceives 
as non-fixated animals do. Even on the 
Lashley apparatus, the animal responds 
to rewarding stimuli in the conventional 
manner, provided it can walk rather 
than jump. 

We are again left with an unanswered 
question: Why does the animal con- 
tinue jumping at its fixated window 
when there is appx: ently no longer any 
problem involving perception, associa- 
tion, or reasoning? 


CONCLUSION 


While much of the data which Maier 
offers as evidence for frustration theory 
can be otherwise explained, the two 
experimental results remaining unac- 
counted for are relevant to two ques- 
tions which any theory of fixation must 
answer: 

1. In general, why do fixations occur? 

2. In a given experiment, why do 
more fixations occur in one group than 
in another? 

Any theory of cognitive difficulty 
must answer that fixations occur be- 
cause the animal is unable to solve the 
problem, and that more fixations oc- 
cur in the group with the more difficult 
problem. The open-window phenome- 
non and the Maier-Feldman study re- 
fute, respectively, each of these ex- 
planations. When faced with an open 
window, the animal has no problem to 
solve, and no fixations should occur. In 
the Maier-Feldman experiment, the 16- 
day randomly-punished animals faced 
the same problem as did the 8-day 
group, and an equal number of fixations 
should have occurred. 

It seems unlikely that some future 
application will reconcile these results 
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with what a theory of cognitive diffi- 
culty would predict. As a matter of 
fact, it may have seemed that liberties 
have been taken in order to make a 
theory fit the data. This was necessary 
if we were to make sure that fixations 
could not be explained in cognitive 
terms. Before accepting new concepts 
—and I think both anxiety-reduction 
and frustration-instigated behavior are 
new concepts—it is worthwhile to make 
sure the old ones are inadequate for the 
purpose. In order to do this, it may be 
necessary to put old concepts, like PAR, 
to new uses. 

If after granting our cognitive theory 
every assumption, we find it inadequate, 
where do we go next? 

In the introduction the various the- 
ories were described as forming a dou- 
ble-Y maze, At the first fork, we took 
the cognitive alley, postulating no moti- 
vational change in the animal, and 
found it promising, but a cul-de-sac. 
It behooves us now, as it would any 
intelligent animal, to retrace our steps 
and to try the conative alley, postulat- 
ing a motivational change. 

It too contains two branches, frus- 
tration and anxiety-re<iuction. We ex- 
plored the cognitive alley first because 
it seemed the more conservative of the 
two. By the same criterion the anxiety- 
reduction alley should be explored next. 
Only if postulating that the animal ex- 
changes one goal for another leads to 
a blank wall, can we seriously consider 
the possibility that the animal exchanges 
his goal for no goal at all, that the fix- 
ated response is “behavior without a 
goal.” 

SUMMARY 


By making certain assumptions about 
the application of the principles of 
perception, association, and reasoning 
(PAR) to the experimental design used 
by Maier in producing animal fixations, 
we were able to account for much of his 











data, but found two results which we 
were unable to explain: 


1. The strength of a position-stereo- 
type response increases as the animal 
spends more time in the insoluble prob- 
lem. 

2. If the nou-habitual window is 
opened, exposing the food, a fixated ani- 
mal continues to jump at the habitual 
window. 

It seems unlikely that PAR can ac- 
count for these two phenomena, since 
cognitive difficulty for control and ex- 
perimental groups is equated. In order 
to explain them satisfactorily, we must 
concern ourselves with the motivational 
factor. 

The two current theories which take 
this factor into account in attempting to 
explain fixations are anxiety-reduction 
theory, in which the new motive be- 
comes one of reducing anxiety, and 
frustration theory, in which there is 
no motive for the behavior. Logically, 
anxiety-reduction theory should be ex- 
plored next. Only if this proves to be 
a blind alley should we seriously con- 
sider the possibility that the fixations 
are “behavior without a goal.” 
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CAN THE ANXIETY-REDUCTION THEORY EXPLAIN 
ABNORMAL FIXATIONS? 


BY NORMAN R. F. MAIER AND PAUL ELLEN 
University of Michigan 


In a recent review of Maier’s book on 
Frustration, Hilgard (2) pointed out 
that the theory of anxiety-reduction, as 
primarily developed by Mowrer, Miller, 
and others (1, 10, 12, 13, 14, and 15), 
was not adequately treated as an alter- 
native to the theory set forth. McClel- 
land (11) has made a similar observa- 
tion, but goes so far as to state that 
the anxiety-reduction theory would 
adequately explain the results. In- 


terestingly enough, Mowrer’s (16) own 
review does not suggest that the anxi- 
ety-reduction concept has been slighted. 

Since the most relevant investigations 
on anxiety-reduction were published 
after Maier’s book was in press, it was 
not possible to include them in the 


discussion. Those that were pub- 
lished seemed, at the time, not to be 
relevant both because frustration was 
not stressed, and because the writers 
on anxiety-reduction made no sugges- 
tion that there was a relationship be- 
tween their work and the work of Maier 
and his students. However, some of 
the recent publications on anxiety-re- 
duction seem to be relevant and justify 
Hilgard’s and McClelland’s observation 
that similar results are being explained 
by different concepts. Whether or not 
the writers on anxiety-reduction would 
agree to this observation is difficult to 
say. Certainly the exponents of the 
theory have made no claim that their 
theory can explain the basic facts of 
abnormal fixations. Nevertheless, in 
the light of opinions expressed by Hil- 
gard and McClelland, and our own feel- 
ing that the recently published work on 
anxiety-reduction is relevant, we feel 
justified in supplementing the frustra- 
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tion theory with a consideration of the 
evidence leading to the anxiety-reduc- 
tion theory. 

The anxiety-reduction theory is of 
particular importance because it is con- 
sistent with the view that abnormal be- 
havior can be reduced to learning con- 
cepts, and, at the same time, postulates 
a need which formerly either was not 
stressed or was entirely overlooked. 
Anxiety-reduction represents a need that 
the organism might develop in a stress- 
ful situation and it might be quite un- 
related to the need that the experi- 
menter may have regarded as the doimi- 
nant one. Thus, an experimenter might 
design his experiment on the basis of a 
primary need and the animal may be 
behaving on the basis of a secondary 
need. This difference between the ex- 
perimenter’s point of view and that of 
the animal might cause the experimenter 
to fail to understand the animal’s be- 
havior. Could this account for the sen- 
ior author’s definition of frustration as 
“behavior without a goal”? 

In order to make a case for frustra- 
tion theory as distinct from motivation 
theory we must examine the relevant 
and specific evidence. It is the purpose 
of this paper first, to examine the ex- 
perimental evidence on which the anxi- 
ety-reduction theory is based and deter- 
mine whether frustration theory can 
adequately account for it. Second, we 
wish to examine the evidence on which 
the frustration theory is based and see 
whether the concepts of anxiety-reduc- 
tion theory can explain the findings. It 
is hoped that this process will clarify 
the differences and permit the design of 
critical experiments. We believe that 
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the differences in views do not stem 
from a disagreement over definitions of 
motivation or of learning, but rather 
they hinge on whether or not a quali- 
tative change occurs when an organism 
undergoes frustration. Thus we believe 
that the issue to be determined is basic 
rather than semantic. It seems that if 
a qualitative distinction exists, it should 
be recognized and faced so that the 
benefits of the change in theory can 
serve their purpose in research design. 
If a qualitative break does not exist 
then prevalent theories should be re- 
fined so that the apparent discontinuity 
is explained. When Newcomb (18) 
distinguishes between threat-oriented 
and goal-oriented responses, he is rec- 
ognizing a qualitative factor, but the 
common term “oriented” tends to ob- 
scure the break in continuity and give 
the appearance of two ends of a con- 
tinuum. 

A casual consideration of the experi- 
ments which led to the formulation of 
the frustration-fixation hypothesis sug- 
gests that they merely represent an ex- 
tension of the frustration-aggression hy- 
pothesis and of the frustration-regression 
hypothesis. Since, however, the expla- 
nation of fixated behavior required a 
qualitative break with learning under 
motivation, a further condition is im- 
plied. This is that frustration not only 
leads to fixation, but that frustration 
results in behavior that has no goal ori- 
entation. If this is true, then other 
behaviors associated with frustration 
should also lack goal orientation. Maier 
(4), therefore, proceeded to fit the be- 
haviors of aggression and regression into 
his frustration mechanism and take them 
out of the realm of motivation. Thus 
the theory states that when an individ- 
ual is placed in a problem situation, he 
first shows problem-solving vior 
which is characterized by vatiability 
and goal seeking. When, however, the 
problem is insoluble, when escape is pre- 
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vented and when pressures force the 
animal into the problem, he will become 
frustrated at one stage or another. The 
appearance of frustration is character- 
ized by a qualitative change in behavior. 
Problem-solving behavior now is re- 
placed by aggression, regression and fix- 
ation and these may appear separately 
or in various combinations. None of 
these behaviors is oriented toward the 
initial goal; in fact they most frequently 
separate the individual farther from the 
initial goal. Thus, when frustration oc- 
curs, the behavior expressed is an end 
rather than a means to an end. 

The manner in which frustration the- 
ory deals with aggression, regression, 
anxiety, neurosis, and compulsive be- 
havior has been discussed in detail else- 
where (4) and need not concern us here. 
The question at issue is whether a quali- 
tative distinction must be made. [If so, 
the implications are great. It means 
that learning theory must exclude frus- 
trated behavior; that social psychology 
must distinguish between groups organ- 
ized around frustration and groups or- 
ganized around common goals (5); that 
attitudes may be of two types and have 
two different origins; that therapy must 
recognize and treat two conditions of 
the organism; and that all behavior 
change, training or correction, must be 
adapted to the condition of the organ- 
ism. 

If fixated behavior can be reduced to 
motivation, no such implications are in- 
volved. It is because the implications 
of the two approaches are so great that 
the evidence must be carefully exam- 
ined and, where the facts are unclear, 
crucial experiments must be made. 


APPLICATION OF FRUSTRATION THEORY 
TO Data SUPPORTING THE ANXIETY- 
REDUCTION HYPOTHESIS 


Mowrer (13) is of the belief that or- 
ganisms can be motivated either by di- 
rectly felt organic needs or by “the mere 
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anticipation of such pressures and that 
those habits tend to be acquired and 
perpetuated (reinforced) which effect a 
reduction in either of these two types of 
motivation” (13, p. 561). He believes 
that although much of human behavior 
cannot be prompted by organic drives, 
it may be explained by “assuming 1. 
that anxiety, i.e., mere anticipation of 
actual organic need or injury, may ef- 
fectively motivate human beings, and 
2. that reduction of anxiety may serve 
powerfully to reinforce behavior that 
brings about such a state of relief or 
‘security’ ” (13, p. 564). Considerable 
experimental evidence has been collected 
by various investigators which seems to 
support the view that fear or anxiety 
(using the terms synonymously) can be 
used to motivate the learning or per- 
formance of new responses in exactly 
the same way as hunger, ihirst, or sex 
(10, 12, 14). 

Miller (12), using a two-compart- 
ment apparatus, one white, with a grid 
as the floor, and the other black, with 
a smooth solid floor, tested the hypothe- 
sis that fear can be learned and can 
serve as a drive to motivate behavior. 
The two compartments in the box were 
separated by a door which could be 
opened in several ways: by the experi- 
menter pressing a button, by the rat 
pressing a bar, or by the rat moving a 
wheel. Preliminary training showed 
that the cues in both the white and 
black compartments were relatively neu- 
tral; that is to say, no preference was 
expressed for either compartment. Then 
the animals were placed in the white 
compartment, and given electric shocks. 
The rats could escape from the compart- 
ment by moving the wheel which would 
open the door between the two compart- 
ments. Ten trials were allowed for the 
learning of the escape response. Shock 
was then discontinued and pressing the 
bar instead of moving the wheel now led 
to escape. On the first non-shock trial 
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the rats exhibited fear responses such as 
urination, biting, and pawing in the vi- 
cinity of the door. On subsequent non- 
shock trials most of the animals showed 
gradual elimination of the extraneous 
movements, and when placed in the box 
moved the wheel rapidly and went into 
the black compartment. This test dem- 
onstrated that the previously neutral 
cues in the white compartment had ac- 
quired the tendency to motivate trial- 
and-error behavior and the learning of 
a new response in exactly the same way 
as do hunger, thirst, etc. (12). Because 
of the animal’s previous training with 
the shock and the behavior exhibited, 
it was postulated that the acquired drive 
was fear, and the escape from the com- 
partment with its consequent reduction 
of fear produced a rewarding effect. 
The results clearly support this postu- 
late. 

Because frustration theory does not 
exclude the learning of avoidance re- 
sponses, this experiment and others of 
similar nature (10, 14) are not ques- 
tioned, except insofar as they may be 
generalized to interpret all experiments 
which use punishment as a training de- 
vice. Since Miller has not attempted 
to generalize his interpretation to in- 
clude abnormal fixations, his findings 
and interpretations are not in conflict 
with this aspect of the theoretical issue. 
In other words, we do not reject the 
possibility of the motivated learning 
of avoidance and approach reactions. 
Where we would differ is in the inter- 
pretation of results when punishment 
is used to an extent where frusttation 
is produced in some or in all of the ani- 
mals. It is for this reason that we 
differ with Farber (1) who attempts to 
generalize the concepts of Mowrer and 
Miller to abnormal fixations. 

Farber (1), applying the principle of 
motivation and learning through anxi- 
ety-reduction, believes that behavior fix- 
ation can adequately be explained in 
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these terms. Essentially his hypothe- 
sis is that the fixation of non-adaptive 
responses in shock situations results 
from uncontrolled secondary reinforce- 
ment which is in the form of anxiety- 
reduction. Since fixation could occur 
under shock conditions, any diminution 
in the strength of the fixation which was 
due to feeding at the locus of shock 
could be ascribed either to direct inter- 
ference with the on-going response, or 
with some process maintaining it. 

To test this hypothesis, four groups 
of rats were given 100 trials in a single 
unit T-maze with food reward in the 
goal box on their preferred side. Dur- 
ing the last 60 trials, two of the four 
groups were shocked immediately after 
the choice-point in the maze during their 
runs to the food. After the completion 
cf this training, one of the shocked 
groups and one of the non-shocked 
groups was fed for two 10-minute pe- 
riods at the place where the shock was 
administered to the shocked groups. 
On the next day, the food was placed 
in the goal-box on the non-preferred 
side for all animals and the shock was 
discontinued. The animals were now 
tested with the position of the food 
changed until the original responses ex- 
tinguished. The number of trials to 
extinction for all four groups was com- 
pared. 

The major findings are shown in Ta- 
ble 1. It will be seen from this table 
that the responses of the shocked rats 
showed greater resistance to extinction 
(61.12 trials) than did those of the non- 
shocked rats (9.71 trials). This find- 
ing, in addition to conforming to Far- 
ber’s hypothesis, is also amenable to 
explanation in terms of frustration the- 
ory. Frustration theory would predict 
that some rats would be frustrated by 
the shock, and this frustration would 
tend to fixate the response. Rats not 
frustrated by the shock would behave 
as motivated animals. Further, it is 
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to be seen from Table 1 that the re- 
sponses of the shocked-non-fed rats 
showed greater resistance to extinction 
(61.12 trials) than did those of the 
shocked-fed group (15.04 trials). Again 
Farber’s hypothesis is supported, but 
this finding also conforms to frustra- 
tion theory since it would be predicted 
that feeding would alleviate the frus- 
tration caused by the shock, and hence 
fewer fixations would result. Although 
the difference between the non-shock 
and the shock-fed groups is not signifi- 
cant, the trend of results would favor 
the position assumed under frustration 


TABLE 1* 


No. or TRIALS TO EXTINCTION FOR ALL S’s 
in Four Sus-Grovups 











Non- 
Shocked — a. shocked- 

il 17 8 10 

14 15 17 6 

12 3 7 12 

8 7 17 8 

57 7 5 10 

41 10 8 7 

48 4+ 7 5 

19 10 6 7 

25 5 9 13 

63 26 6 19 

14 16 8 6 

24 21 10 8 

27 4 11 9 

33 4 8 4 

255t 19 5 7 

263t 3 15 2 

15 6 6 3 

57 33 17 5 

33 3 17 9 

163 98 11 6 

36 10 11 17 

193 3 8 28 

10 10 7 17 

46 27 11 10 

Mdn. 33 10 8 8 
Mean 61.12 15.04 9.71 9.50 
S.D. 74.09 19.24 3.83 5.75 














* Table reproduced from Farber (1, p. 122). 
t These animals refused to extinguish at all 
under the ordinary extinction procedure. 
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theory, since it is very likely that feed- 
ing may not remove frustration com- 
pletely. 

The results thus far examined appear 
to be adequately accounted for under 
both frustration theory and Farber’s 
anxiety-reduction formulation. How- 
ever, closer examination of Farber’s 
data reveals certain essential details 
which have not been accounted for by 
an anxiety-reduction theory, and which 
appear to be adequately explained by 
frustration theory. Thus, frustration 
theory would expect that some rats 
would not be frustrated by the shock, 
and would extinguish as non-shock rats; 
whereas others would be frustrated and 
would show high extinction scores. Ta- 
ble 2 presents a frequency distribution 
of the extinction scores of Farber’s ani- 
mals. It will be noted from this table 
that 11 rats in the shocked group 
showed scores which fell within the 
range (1-30 trials) df the non-shocked- 
fed group. It is also to be noted that 
4 animals in the shocked group have 
scores clearly falling outside this range, 
their scores being 163, 193, 255, and 
263. Even if we discount the fact that 
the last two animals stil] had not ex- 
tinguished in the trials given we find 
that the best of these animals deviates 
26 S.D. units from the mean score of 
the non-shocked-fed group. It is ob- 
vious that they, at least, form a sepa- 
rate population even if we disregard 9 
other animals which exceed the range 
of the non-shocked-fed group. In a 
similar manner, we may note that 22 
rats in the shocked-fed group extin- 
guished as non-shocked-fed animals, 
whereas, only two animals in that group 
exceeded the highest score for the non- 
shocked-fed rats. The score of the 
animal which falls within the interval 
31-35 may be considered doubtful as 
to whether it is outside the normal range 
of scores, but the score which falls 
within the interval 96-100 is clearly be- 
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TABLE 2 


FREQUENCY DISTRIBUTION OF EXTINCTION 
Scores oF FarBer’s ANIMALS 





Non- 
” Ishocked 





a i- 





Cmcoccoocoocoos | He wee rf 


SOSSCSSSHKNONRK KN! HK HK UNO 
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4 (low-163) 





yond the normal range of scores. If 
we eliminate the four animals in the 
shocked group and the one animal in 
the shocked-fed group which clearly de- 
viate from:the normal distribution of 
scores (non-shocked-fed rats), we find 
that the mean of the shocked group 
would be 29.75 instead of 61.12, with a 
S.D. of 17.5 instead of 74.04; and the 
mean of the shocked-fed group would 
be 10.3 instead of 15.04, with a S.D. of 
9.2 instead of 19.24. It is obvious that 
these five animals contributed most of 
the variance within their respective 
groups, thereby accounting for the ma- 
jor differences between the shocked- 
group and the non-shocked-fed group. 
It is these animals which clearly sepa- 
rate themselves from the rest of the 
population and whose behavior is clari- 
fied by frustration theory. The anxiety- 
reduction theory does not postulate a 
break or split in the population, and 
this may explain why Farber failed to 
appreciate the unusual nature of his 
distribution. 

Let us now examine the results of 
two other studies and see whether the 
data can adequately be interpreted in 
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terms of frustration theory. Although 
the authors do not generalize their in- 
terpretations to include abnormal fixa- 
tions, such as we have described, we 
believe that frustration theory can be 
used to advantage in understanding the 
behavior obtained. 

Mowrer (15) describes an experiment 
in which rats were trained to escape 
from a compartment which had a 
charged floor. When the floor of the 
escape compartment was also charged, 
it was found that the rats persisted in 
their previous “escape” behavior even 
though it was now unadaptive. Mowrer 
believes that such “crazy behavior” ne- 
cessitates the postulation of two basic 
learning mechanisms; problem-solving 


and conditioning. By problem-solving, 
overt actions or habits are acquired; 
these reduce drives, solve problems. By 
conditioning, the emotions or second- 
ary drives are acquired. The so-called 
“crazy behavior,” therefore, is inter- 
preted as follows: 


“ . . fear of. the apparatus is reinforced 
every time the rat gets shocked therein; 
and the habit of running in response to 
this fear is reinforced by the fact that the 
running, though resulting in a momentary 
exposure to the shock, leads to a relatively 
enduring reduction in the sonengety drive 
of fear” (15, p. 593). 


It appears that Mowrer has recog- 
nized the fact that he is dealing with 
two distinctly different kinds of be- 
havior, and that a modification in a 
learning theory which postulates need- 
reduction is necessary. However, the 
attempted modification implies some 
rather tenuous assumptions concerning 
the operation of secondary rewards. 
According to frustration theory, one 
would interpret the original response 
which removes the animal from the 
charged compartment as motivated 
learning. When, however, shock is used 
to punish the escape response, the ani- 
mal becomes frustrated, and as a con- 
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sequence, fixates the response in prog- 
ress. It is this fixation that supplies 
the persistent and unadaptive charac- 
teristics to the behaviot. There is a 
change, therefore, from a motivated 
instrumental act to a frustration-insti- 
gated act because frustration has been 
induced. It appears then that the ap- 
parently “crazy behavior” which Mow- 
rer describes can adequately be inter- 
preted under the assumptions implied 
in a frustration theory. 

In a similar manner, we may examine 
another experiment by Mowrer and 
Viek (17). Using an inhibitory index 
of fear, these authors compared the 
number of response inhibitions between 
two groups of rats placed in an escape 
situation. Essentially, the procedure 
employed was as follows: On each trial 
hungry rats were offered food on a stick 
for 10 seconds. Regardless of whether 
the animal ate, shock was applied 10 
seconds later. For one group of 10 
rats, the correct response to the shock 
was jumping into the air as a means 
of turning off the shock (shock-control 
group). For a second group of 10 rats, 
shock was applied for the same duration 
that it took the shock-control animals 
to turn off the shock, but for this group, 
no specific response would terminate the 
shock. This second group will be desig- 
nated as the shock-uncontrolled group. 
One trial a day was given to each ani- 
mal. The authors reasoned that for 
the first group (shock-control group), 
“the anticipatory tendency to jump 
should bring forward with it some of 
the relaxation which ordinarily follows 
the termination of shock.” Therefore, 
these animals should be less frightened 
and should show more tendency to eat 
than the animals in the shock-uncon- 
trolled group for which this anticipatory 
response could not be acquired. As 
seen in Table 3, this prediction is borne 
out in that only 16 eating inhibitions 
occurred in 15 days in the shock-control 








ANXIETY-REDUCTION THEORY AND ABNORMAL FIXATIONS 


TABLE 3* 


No. oF RESPONSE INHIBITIONS OBTAINED IN SHOCK-CoNTROL GROUP AND 
SHocK-UNCONTROLLED GROUP 








Successive days 3};4/5 


6 7;819 





Shock-control group 110;}2 


2,;2}3)1 





Shock-uncontrolled 
group 0;1/4;)5147 




















6/7) 4 



































* Cited from O. H. Mowrer and P. Viek (17, p. 196). 


group as compared to 85 in the shock- 
uncontrolled group. 

However, it would seem that if fear- 
reduction is learned then the shock- 
uncontrolled group should also learn 
some method for reducing fear. If their 
situation does not supply an easily 
learned response, they should merely 
differ from the first group in that their 
fear-reducing response would be de- 
layed. Thus the shock-uncontrolled 
group should show a tendency to re- 
duce its inhibitions in successive days. 
Comparing the first seven with the last 
seven days, we find a total of 30 inhibi- 
tions for the first seven days and a to- 
tal of 51 inhibitions for the last seven 
days, a trend which is just opposite to 
what should be expected. 

Frustration theory would predict that 
eating inhibitions would be more com- 
mon in frustrated than in non-frustrated 
rats. Since the animals in an insoluble 
problem are the most likely to become 
frustrated, we would predict more rats 
showing eating inhibitions in the shock- 
uncontrolled group than in the shock- 
control group. This corresponds to the 
findings of Table 3. Further, we would 
predict that animals would not be frus- 
trated until they experienced a certain 
amount of failure, and that all would 
not become frustrated at the same time. 
We should expect, therefore, that at 
the outset, the number of rats showing 
eating inhibitions would be low and 
that they would rise as long as the 


number of frustrated rats increased. 
Examination of Table 3 shows that for 
the first two days the animals in the 
sheck-uncontrolled group (the insolu- 
ble problem) behaved very much like 
the shock-control group in that rats 
showing eating inhibitions were almest 
entirely absent. Thereafter inhibition 
became more frequent, and by the fifth 
day the number of rats showing eating 
inhibitions in the shock-uncontrolled 
group reached seven. On one occasion 
only thereafter did the cases of eating 
inhibition fall below the nursber found 
during one of the first four days. No 
such trend in the data is found in the 
shock-control group for which the prob- 
lem was soluble. Rather it appears that 
this group finally approached mastery 
of the problem. 

We feel, therefore, that in this in- 
stance too, frustration theory can ade- 
quately account for the major results 
and it also can explain additional de- 
tails in the data. 


APPLICATION OF ANXIETY-REDUCTION 
CoNCEPTs TO ABNORMAL FIxa- 
TION EXPERIMENTS 

Having examined and reinterpreted 
some of the experimental evidence put 
forward by the anxiety-reduction the- 
orists, let us examine some of the data 
on which the abnormal fixation principle 
is based and thereby test the adequacy 
of the anxiety-reduction theory. 





NorMAN R. F. Marer AnD Paut ELLEN 


Using anxiety-reduction and learning 
concepts, it would be necessary to as- 
sume: (1) that a frustrating situation, 
i.e., forcing the animal to respond to a 
problem it is incapable of solving, causes 
anxiety; (2) the animal’s aim becomes 
one of reducing the anxiety, rather than 
seeking the food reward; and (3) any 
respouse that allows relief from the anx- 
iety in the situation will be adopted. 
Using these assumptions, let us attempt 
to predict the behavior that results when 
a rat is placed in an insoluble problem 
situation. « 

At first, as would be expected in any 
trial-and-error learning problem, ani- 
mals would tend to show considerable 
variability in behavior. If the prob- 
lem is insoluble, however, or if it is too 
difficuit, the animal will refuse to jump 
after several failures. The application 
of an air-blast or electric shock would 
then be necessary to evoke a response. 
It is possible then, that the forcing 


agent may serve as an unconditioned 
stimulus for behavior elicited; and the 
total experimental situation may serve 
as a conditioned stimulus to which con- 
ditioned fear and other responses may 
become associated. There then might 
be a shift in the motivational aspects 


of the situation. That is to say, the 
response that is made under such con- 
ditions would not continue to be di- 
rected toward obtaining food, but rather 
would change to one of escaping from 
the experimental situation. The re- 
sponse that occurs as a result of the 
application of the electric shock or air- 
blast (jumping away from the platform 
and toward the screen) might be pre- 
dicted as being the one which will be 
learned or fixated, because it is the 
first response which removes the ani- 
mal from the situation. When the 
problem is changed so that a solution 
is possible, it could be predicted that 
the response developed in the no-solu- 
tion situation would continue. The 


basis for this latter prediction could 
reside in the fact that the animal never 
tries or is motivated to try the alterna- 
tive response, since the on-going re- 
sponse is one which satisfactorily re- 
duces the anxiety in the situation. Thus 
the inability of a rat to learn a discrim- 
ination response after having been sub- 
jected to an insoluble problem would 
be ascribed to the absence of a motivat- 
ing condition which would call forth 
alternative behavior. Phrasing it an- 
other way, it might be said that the 
animal has no incentive to try an al- 
ternative, since the response that is 
being practiced reduces the conditioned 
anxiety. In this way, then, an abnor- 
mal fixation would be considered as a 
learned adjustive mechanism to a try- 
‘ing situation, rather than the result of 
a separate frustration mechanism. This 
formulation would reduce the abnormal 
fixation to a very strong habit which 
is acquired in accordance with current 
learning principles. 

On the surface this hypothesis ap- 
pears to provide a satisfactory explana- 
tion for abnormal fixations, and also 
tends to fit common-sense impressions 
and practices. However, there are cer- 
tain other experimental findings which 
appear to be inexplicable in terms of 
an anxiety-reduction theory of fixation. 


1. After animals have developed posi- 
tion fixations in the insoluble problem, 
and are confronted with a soluble dis- 
crimination problem, they learn which 
card punishes and which card rewards 
(expressed by the manner of jumping), 
but instead of practicing this learning, 
they continue to execute the fixated re- 
sponse (7). According to the anxiety- 
reduction theory, it would seem reason- 
able to expect that an animal should 
practice the discrimination which it has 
learned and which offers more anxiety 
reduction, rather than continue practic- 
ing the fixated response. A response 
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which gives the animal escape from the 
stand and no punishment from the card 
should reduce anxiety more than a re- 
sponse which gives escape and punish- 
ment for jumping 50 per cent of the 
time. 

2. It has been found that in replac- 
ing one response with another, 100 per 
cent punishment either causes a very 
rapid learning of the new response, or 
causes fixation of the initial response, 
whereas 50 per cent punishment results 
in slower learning of the new response 
for those animals that learn, but the 
incidence of fixation of the initial re- 
sponse is less frequent (8). No such 
difference should occur if we assume 
that the animal is merely attempting 
to escape from the jumping platform, 
since the escape from shock factor is 
constant in the two punishment condi- 
tions. If, however, the anxiety-reduc- 


tion theory included fear of the punish- 
ment which occurs as a consequence of 
jumping incorrectly (striking the locked 


window, etc.), it would demand the 
more rapid learning of the new response 
with 100 per cent punishment for the 
old response than with 50 per cent pun- 
ishment, because the anxiety-reduction 
would be greater with 100 per cent pun- 
ishment than with 50 per cent punish- 
ment. But then the greater number of 
fixations that are obtained with 100 per 
cent punishment than with 50 per cent 
punishment would not be explained. 

It is also possible that one might at- 
tempt to explain the different effects of 
100 versus 50 per cent punishment on 
the basis of some probability theory. 
Humphreys (3) found that responses 
learned under 50 per cent reward were 
less easily extinguished than responses 
learned under 100 per cent reward, and 
he effectively explained his results in 
terms of expectancies of reinforcement. 
One might conceivably use a similar 
approach and explain why 50 per cent 
punishment is more disrupting to an old 
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response than 100 per cent punishment. 
However, if this approach were used, 
only the difference in the number of 
fixations produced by these two pat- 


- terns of punishment would be explained, 


but the fact that 100 per cent punish- 
meni is superior to 50 per cent punish- 
ment in that it produces more rapid 
learning of a new response would be 
unexplained. This dual effect of pun- 
ishment thus poses a serious difficulty 
for both a reinforcement and a proba- 
bility theory. 

3. When animals are required to 
modify the responses developed in an 
insoluble problem situation, a bimodal 
distribution of scores obtains; namely, 
those rats which can solve the new prob- 
lem and those which cannot (7). If 
all that is involved is a single motiva- 
tion mechanism, should not all ani- 
mals learn an anxiety-reducing response 
or some other secondary reward and 
merely differ in degree? The fact that 
some animals learn a new response at 
a normal rate and others do not change 
at all, presents an important problem 
for any present motivation theory of 
fixation. 

4. It has been observed that a frus- 
trated rat will persist in jumping to its 
fixated side even when an open window, 
in which food is visible, is presented on 
the other side (4). If the animal’s be- 
havior is designed to escape from the 
situation, it would seem reasonable that 
it should escape via the open window. 
The apparent compulsiveness of the fix- 
ated response is an important problem 
which is not explained by the new pos- 
tulates developed by anxiety-reduction 
theorists. 

5. It has also been observed that a 
rat will fixate a response in progress 
during a frustrating experience rather 
than exhibit some trial-and-error be- 
havior which is characteristic of other 
avoidance learning problems (7). An 
anxiety-reduction theory, like all other 
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learning theories, would predict that 
when an animal is faced with an insol- 
uble problem situation, the behavior at 
first should be quite variable. The fact 
that variability is present in some prob- 
lem situations and not in others is a 
fact which must be explained if no dis- 
tinction is made between frustrated and 
motivated behavior. 

6. When a situation is changed so as 
to require a new response, it was found 
that guidance causes animals to give up 
their old responses before they have 
learned the new response, whereas trial- 
and-error causes animals to give up an 
old response only after the new response 
is learned (9). The fact that guidance 
is primarily effective for breaking an 
old response and is not conducive to 
the learning of a new response, whereas 
trial-and-error has the opposite effects, 
is difficult to reconcile with any theory 
that makes learning a function of rela- 
tive excitation values. In terms of 
strength of response a change should 
occur only when an alternative develops 
greater strength than a former response. 

7. Finally, it has been shown that 
the behavior of frustrated rats is more 
rigid than is found in other cases of 
learning (6,9). If frustrated behavior 
is merely a form of learning, based on 
secondary reward, should not such be- 
havior respond to the same methods of 
response alteration which are effective 
for breaking habits based on primary 
rewards? If a distinction is not to be 
made, one would seemingly have to pos- 
tulate that secondary rewards are more 
basic and form stronger associations 
than do primary rewards. 

The above observations represent the 
major findings which are basic to frus- 
tration theory, and represent the type 
of evidence which was responsible for 
its formulation. It is because the find- 
ings did not iend themselves to expla- 
nations in terms of learning concepts 
and often went contrary to well-estab- 


lished principles, that a new direction 
in thinking seemed necessary. Frus- 
tration theory thus represents an at- 
tempt at a solution to the “neurotic 
paradox” (15). 


SUMMARY AND CONCLUSIONS 


We have presented the evidence in 
favor of both an anxiety-reduction the- 
ory and a frustration theory. We have 
also attempted to explain the results 
obtained by anxiety-reduction theorists 
in terms of frustration theory, and have 
found that the results are entirely con- 
sistent with frustration theory. Fur- 
ther, we have found that frustration 
theory can explain details in the re- 
sults of the experiments designed to 
test the anxiety-reduction theory and 
have found that these details are con- 
sistent with frustration theory, but the 
details have not been explained by the 
anxiety-reduction theory. Finally, we 
have attempted to apply the concepts 
of an anxiety-reduction theory to the 
experimental findings on which frustra- 
tion theory is based, and have found 
them inadequate. Future authors may 
be able to encompass the problems we 
have raised, but we believe this will re- 
quire radical alterations in learning the- 
ory. We believe that these problems 
can be explained more parsimoniously 
on the basis of frustration theory. This 
theory does not require the discarding 
of basic learning concepts, and at the 
same time permits an extension of be- 
havior analysis by developing a new set 
of concepts which operate when the or- 
ganism is in a state of frustration. 

In separating motivation and frustra- 
tion into two separate mechanisms, we 
have the basis for explaining contradic- 
tory behavior trends without having to 
reconcile them. The implications of 
making such a distinction are very wide- 
spread, and invade practically every 
field of psychology. If these implica- 
tions are to be aveided, some adequate 
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way for accounting for the qualitative 
discontinuity in present experimental 
findings must be found. It appears 


that the crucial experiments must cen- 
ter on the problem of qualitative behav- 
ior changes, and it is hoped that the 
present analysis will point up this need. 
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THE AMOUNT .OF INFORMATION IN 


ABSOLUTE JUDGMENTS * 


BY W. R. GARNER AND HAROLD W. HAKE 
The Johns Hopkins University 


By absolute judgment we mean that 
type of judgment in which an O of 
a particular stimulus identifies that 
stimulus with a single name or num- 
ber. He does not make a judgment 
of whether a stimulus is greater than, 
less than or the same as another. He 
states that a stimulus has a particular 
value, whether the value is some physi- 
cal measure, ¢.g., 10 millilamberts, or 
some subjective term, e.g., red. Such 
judgments are probably made by com- 
paring the present stimulus with some 
average subjective standard, but the 
method of obtaining judgments which 
we describe requires no explicit com- 
parisons. Stevens (8) has called this 
type of judgment séimulus rating and 
defines the problem as the determina- 
tion of the accuracy and precision with 
which physical values can be judged. 
We prefer not to use that term because 
it connotes (at least to us) the judg- 
ment of magnitudes, and absolute 
judgments can be made of qualita- 
tively different categories. 

With such judgments, whatever 
they are called, it is meaningful to ask 
certain types of questions for which 
the traditional measures of central 
tendency, dispersion, or correlation do 
not provide immediate and convenient 
answers. For example, the stimulus 
can be thought of as an event which 
occurs with a certain probability. 
The distribution of judgmental re- 
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Devices Center, Office of Naval Research, 
and The Johns Hopkins University. This is 
Report No. 166-I-137, Project Designation 
No. NR-784-001, under that contract. 


sponses of an O can be used to deter- 
mine how accurately he perceived 
which of several stimuli occurred at a 
particular time. In a sense we want 
to ask how much information an O 
obtained about which of the several 
alternative stimuli occurred, and for 
this question it would be convenient 
to have a measure of amount of in- 
formation. 

It is the purpose of this paper to 
show that a measure of the amount of 
information is useful for this type of 
problem. In addition, we shall show 
in a later part of the paper that this 
measure of information gives a direct 
indication of the minimum number of 
stimuli from a set which can convey 
or transmit the maximum information 
about some other set of events. 

Miller and Frick (3) have shown 
that there are many possible applica- 
tions of this measure of information 
and associated concepts in psychology, 
and they have explained the rationale 
of the measure. Before going on to 
our specific problem, however, we shaii 
give a brief description of the measure 
to aid in clarifying the specific appli- 
cations outlined in this paper. 

A measure of information. Briefly, 
the amount of information which is 
contained in or conveyed by the occur- 
rence of an event is a monotonic func- 
tion of the number of equally likely 
alternative events which could have 
occurred. Thus, the amount of infor- 
mation contained in a particular event 
is not determined by what happened, 
but rather by what could have hap- 
pened. For example, when a par- 
ticular stimulus occurs we get more 
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information if the stimulus is one of 
20 possibilities than if it is of two 
possibilities. In this sense, informa- 
tion is equivalent to uncertainty—the 
more uncertain we are that an event 
is going to occur, the more information 
we get when it does occur. (If we 
know that something is going to hap- 
pen, we get no information at all when 
it does happen.) Practically speak- 
ing, the terms information and uncer- 
tainty can be used interchangeably, 
although there are times when it seems 
more appropriate to use one rather 
than the other. But in terms of the 
measurements, a measure of uncer- 
tainty is the same as a measure of 
information. 

The particular measure used is the 
logarithm to the base two of. the 
number of equally likely alternatives, 
and the resulting unit is called the bit. 
This measure states, in effect, the 
number of two-choice discriminations 
which must be made in order to 
specify one event from a number of 
alternatives. t For example, if an event 
is one of ‘eight “equally likely alterna- 
tives (p = .125), the amount of infor- 
mation (J) is 3 bits. Mathematically, 

[= logs 5 =—— loge p = 3.0. 
The three two-choice discriminations 
are first to specify which of two groups 
of four, then which uf two groups of 
two, then which of the two in the last 
group. 

If the alternative events in a group 
do not have equal probabilities of 
occurrence, then the amount of infor- 
mation in the occurrence of a particu- 
lar one of the events is still computed 
by taking the logarithm of the recip- 
rocal of the probability of occurrence. 
If the amount of information con- 
veyed by the long-run average of the 
entire set is desired, however, then the 
various amounts of information are 
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averaged, but in the averaging each 
computed amount is weighted accord- 
ing to its probability of occurrence. 
Thus, 


1 
I= 2 p logs > = — L p logs p. 


The measure does not require that 
the alternative events be on a single 
continuum. Furthermore, if two 
events are different, there is nothing 
in the computation which is concerned 
with how different they are. This is 
not to say that the measure cannot be 
applied when a metric difference is 
known, but rather that the measure 
can be used whether or not the metric 
is known. For many types of quali- 
tative judgment, where it would be 
impossible to obtain a true correla- 
tion, the amount of information can 
be computed and can convey impor- 
tant implications about the data. 


CALCULATION OF THE AMOUNT OF 
INFORMATION FROM ABSOLUTE 
JUDGMENTS 


The type of data matrix we shall be 
discussing is illustrated in Table I. 
Various stimulus categories (k, num- 
bered from 1 to S) are presented in a 
random order to an O who then re- 


TABLE I 


SymMsoLic REPRESENTATION OF THE NUMBER 
oF JoINT OCCURRENCES oF STIMULUS 
CaTeGory (k) WITH RESPONSE 

Catecory (j) 
1 Bi ee Beant oe 
Nu Nu 54 Mu > + Mis MM. 
Nu Na Nu Nes N2. 
Nis Nj. 


Na Na Nix 


Nm Ne Nr Nes Nr. 
Ni Na Nu Ns N 
The number of events in cell, jk. 
The row and columh totals. 

The total number of events in the 


matrix. 
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sponds by judging the stimulus to be 
in one of the response categories (j, 
numbered from ito R). Thus wehave 
a matrix with S X R total cells, and in 
each cell is indicated the frequency 
with which a particular stimulus was 
presented and called a particular re- 
sponse. By adding down columns or 
across rows, the total number of times 
that each stimulus was presented (NV), 
and the total number of times that 
each response category was used (N;j.) 
can be determined. 

From these frequencies, the follow- 
ing probabilities can be computed: 


N; 
p(j,k) = V : 


p(j) = 


p(k) = 
pj, k) 
p(k) 


pik) | 
p() * given j. 


wei fF 
bx(j) = W, = 


Ni 
p(k) = = 


From this matrix we want to com- 
pute the amount of information an O 
gets about which of several alternative 
stimuli occurred on a given presenta- 
tion. This amount of information we 
can call information transmitted (J;). 
Before computing this quantity, how- 
ever, it is necessary to consider two 
other quantities which enter into the 
calculations, and which set limiting 
values on the amount of information 
transmitted. 

In the first place, the number of 
alternative stimuli will determine the 
maximum information transmitted, 
because an O cannot receive more in- 
formation than is contained in the 
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stimuli originally. Basically, all we 
need to know is how much uncertainty 
there is in knowing which stimulus is 
going to occur at a particular time. 
If all stimuli are presented an equal 
number of times, then the uncertainty 
is determined simply by the number 
of possible choices. If the stimuli 
occur an unequal number of times, 
then we shall have to use some 
weighted average measure. This un- 
certainty in the stimulus we can call 
the stimulus information (I,), and with 
the symbols we are using it is defined 


Probability of the joint occurrence of stimulus 
category, k, and response category, j. 


(1) 


Probability of occurrence of j. 
Probability of occurrence of k. 


: Conditional probability of occurrence of j, given k. 


Conditional probability of & having occurred, 


(5) 
as 


(6) 


I. = — F p(b) logs pl). 
t=1 


Table II is a simple matrix of hypo- 
thetical data which we shall use to 
illustrate the calculations. With this 
example four different stimuli are pre- 
sented an equal number of times. 
Thus the stimulus information is two 
bits. 

Another factor which will limit the 
amount of information transmitted is 
the uncertainty in the responses when 
the stimulus is not known. For ex- 
ample, if in our example only two 
responses were ever used, then as far 
as we can tell the O could obtain only 
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one bit of information about which 
stimulus occurred. This uncertainty 
in the responses we can call response 
information (I,), and it is defined as 


R 
i= = * P(j) log: p(j). — (7) 


In our example, the response informa- 
tion is 
I, = — [.24 logs .24 + 2(.28 logs .28) 


+ .20 log: .20] 
= 1,99, 


It is less than J, because all response 
categories were not used equally often. 

The amount of information in the 
judgmental responses indicates to a 
certain extent the number of judg- 
ment categories which can profitably 
be used in a rating scale. If, for ex- 
ample, an industrial rating scale with 
25 points were used, but the distribu- 
tion of ratings showed that only three 
bits of information were obtained from 
the judgments (indicating that all 25 
points were not used equally often), 
then as much scale efficiency should 
be possible with only eight points as 
with the original 25. 

Information transmitted. In order 
to calculate the amount of information 
transmitted, we can look at the prob- 
lem from three different points of view. 
We can deal with the stimulus infor- 
mation as the limiting factor, with the 
response information as the limiting 
factor, or we can deal with the entire 
matrix of cell probabilities. These 
three ways of looking at the problem 


TABLE ITI 


TABLE OF JoznT OcCURRENCES OF STIMULUS 
(Rk) AND RESPONSE (j) CATEGORIES 


Nj. 
96 
112 
112 
80 
400 


Tase III 


1 2 3 


‘ 20 
4 68 20 
c) Sie ieee 
12 68 
O 2 2 RM 


* The cell entries are the p:(j). 


result ‘in three different methods of 
calculation, but since they are mathe- 
matically equivalent the technique 
used may depend only on the form in 
which data happen to be set up. 

(1) Table III has been set up for 
what is probably the easiest method 
of calculation.! The logic of this 
method is as follows: The maximum 
amount of information which can be 
transmitted is determined by the re- 
sponse information (J,). From this 
quantity, however, a term must be 
subtracted to account for the fact that 
each time a stimulus is presented, the 
same response does not always occur. 
If a given stimulus does not always 
produce the same response, then the 
amount of information transmitted is 
less than J,, What we have to sub- 
tract is the uncertainty about which 
response will occur when we know 
which stimulus has been presented. 
This uncertainty must be computed 
for each stimulus separately, and a 
weighted average determined for all 
stimuli taken together. We can call 
this uncertainty response equivocation 
(Z,) and define it as 


E, = — 5 p(k) E pal) logs pal). (8) 
k=1 imi 


The information transmitted, which 
Laemmel (2) has called intelligence, is 
the response information minus the 
response equivocation. Thus, 


I; = J, — E,. (9) 


'Newman (4) has published a table of 
—p log: » for p from 0.001 to 0.999. 
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If all stimuli occur equally often, 
then 


8 R 
E,= —25 E pal) logs pal). (10) 
k=l j=l 


For the example, 


+ +++ + 32 log, .32 
+ .68 logs .68] 
= 1.03 
and 
I, = 1.99 — 1.03 = 0.96. 


(2) Table IV has been set up for the 
second method? of calculation. This 
method is basically the same as the 
first, except that the.stimulus infor- 
mation (J,) is taken as the limiting 
value, and from it is subtracted the 
stimulus equivocation when the re- 
sponse is specified. Thus the propor- 
tions shown in the cells in Table IV 
are the proportion of times that a par- 
ticular stimulus had been presented 
where a particular response category 
was used. The amount of equivoca- 
tion within each row (E,), weighted 
according to the frequency that a par- 
ticular response is used, is subtracted 
from the amount of stimulus informa- 
tion (J,) to obtain the amount of 
information transmitted. We have 


R 8 
E, = — © (7) & pik) logs pk) (11) 
j=1 k=1 
= — [.24(.79 log,.79 + .21 logs .21) 
+ +++ +.20(.15 logs .15 
+ .85 log: .85)] 
= 1.04 


and 
I=I,-£, 
= 2.00 — 1.04 = 0.96. (12) 


These first two methods are identi- 
cal excépt that in the first case we deal 


? This method follows that of Laemmel (2) 
for computing the amount of transmitted 
intelligence and that of Shannon and Weaver 
(7) for describing the rate of actual trans- 
mission in communication systems. 


TaBLe IV 
1 2 3 4 pf) 
1 .79* 21 .24 
2 21 61 .18 .28 
3 ll 61 .28 .28 


4 5 85 -20 
p(k) .25 25 25 25 


* The cell entries are the p;(k). 


with probabilities within columns and 
in the second we dea: with probabili- 
ties within rows. Superficially, the 
technique is very similar to that used 
in an analysis of variance, but no 
simple mathematical relation between 
the two types of computation exists. 

(3) The third method of calculation 
is the most basic of the three, and both 
of the other techniques can be derived 
from it. Table V has been set up for 
this method of calculation. With this 
method, the matrix is treated as a 
group of individual cells rather than 
as sets of rows and columns. The 
basic logic of the method is this: If no 
information were transmitted to the re- 
sponses by the stimuli, then the cells 
would have frequencies determined 
by chance. The chance frequencies 


TABLE V 

i 2 3 4 pli) 

19* 05 00 00 24 
; .06** 06 06 06 

.06 17 05 00 .28 
: .07 07 07 07 

.00 03 17 .08 .28 
: 07 07 07 07 
. .00 .00 03 17 -20 


05 05 05 0S 


p(k) 25 25 «2525 


* The top entry in each cell is the (j,k). 
** The bottom entry in each cell is the 
P(j,k), where P(j,k) = p(j)p(k). 


anf 
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in each cell are influenced only by 
the row and column totals in exactly 
the same way as for Chi-square. 
The amount of information computed 
from these expected cell probabilities 
(P(j, k)) is the maximum uncertainty 
which can be obtained in the matrix 
with the given row and column totals; 
and if this uncertainty were actually 
obtained, then no information has 
been transmitted about which stimuli 
had occurred. Theactual uncertainty 
obtained is computed from the cell 
probabilities as they occurred. This 
figure will be less than the maximum 
because there always is less uncer- 
tainty about where frequencies will 
fall into cells as long as there is any 
correlation between stimuli and re- 
sponses. Thus we have a maximum 
uncertainty, and an obtained uncer- 
tainty, and the difference between 
them is the information transmitted. 
The extent to which the maximum 
uncertainty has been decreased by the 
correlation between stimuli and re- 
sponses is the extent to which infor- 
mation has been transmitted. We 
have,? 

* Equations (9) and (12) can be derived 
from equation (13) as follows: 


First, 
P(j,k) = p(j)p(k) 


and substituting in the first part of equation 
(13), ve have 


R 8 
—Z = p(j)p(k) logsp(j) p(k) 
im. km 
RS 
= — 2 J p(s)p(k) logs p(j) 
im] 1 
Rg 8 
— 2 Z pj) p(k) logs p(k) 
im] kml 
8 RK a 
= —[2 p(k) [2 ply) log: p(s) ] 
k~i i~1 


R 8 
— C2 PICS p(k) logs p(k] 
im! kl 


R 8 
= — Z p(j) logs p(j) — Z p(k) logs p(k). 
jm to | 


L= -E EPG.) logs PG #) 


j=l 


+E Epi) logep(j.k) (13) 
j=l b=1 


= — [4(.06 log, .06) 
+ 8(.07 logs .07) 
+ 4(.05 logs .05) ] 
+ [.19 logs .19 + .05 log: .05 
+ bs hee + .03 loge .03 
+ .17 logs .17] 
= 3.99 — 3.03 = 0.96. 


INTERPRETATION OF THE MEASURE 


In order to interpret the informa- 
tion transmitted (J;), it is convenient 


By equations (6) and (7) 
band I + 5 
Thus the total information or uncertainty in 
the matrix of expected cell frequencies is 
equal to the sum of the information in the 
stimuli and in the responses. 
PUR) = plk)pe(7) = P(7) pile) 


and substituting in the second part of equa- 
tion (13), we have 


RS 
— = 2 p(k) pel) loge p(k) pe(7) 
fut k=1 
Rk 8 
= —Z Z pr(j)plk) logs p(k) 
im] bmi 
R8 
— Z Z plk)pe(j) logs pe(Z) 
int b=1 
8 
= — & p(k) logs p(k) 
kt 


8 R 
— Z p(k) J peli) loge pe(J) 
—. jmt 


= I, + &,. 


Thus the information or uncertainty in the 
matrix of obtained cell frequencies is equal to 
the sum of the information in the stimuli plus 
the response equivocation. In like manner, 
it can be shown to equal (J, + E,). Equa- 
tions (9) and (12) are derived as 
L=I,+,—-I,—-E, 
=I,—E, (9) 


.=1,+1,—I,-E, 
=I,— &,. 


and 


(12) 











452 W. R. Garner AND Harotp W. HAKE 


to think of the stimulus as being a 
symbolic representation of a series of 
possible events, as illustrated in Fig. 1. 
Here we have a series of possible 
events (which need not be on a true 
continuum), and we are representing 
ranges or classes of events with a single 
stimulus value. For example, all 
events between EZ, and £; are repre- 
sented as the stimulus value S;. This 
5S, is then responded to as Ri, which 
may be anything from a numerical 
report toacourseofaction. (Itshould 
be remembered that a stimulus may 
represent a class of events which are 


stimuli of the same kind.) We have 


complete freedom in deciding which 
stimulus value will represent which 
class of events, and there need not be 
a one-to-one correspondence between 
the events and the stimulus represen- 
tation. In fact, as we shall point out 
later, there usually should not be a 
one-to-one correspondence. But we 





shall nearly always want to have a 
one-to-one correspondence between 
the stimulus and the response. 

When we talk about the ameunt of 
information transmitted, we refer to 
the amount of information about the 
event continuum which a particular 
range of stimulus values can transmit 
to an O. It does not indicate how 
much information is in the event con- 
tinuum, or how much could have been 
transmitted with some other type of 
symbolic representation or with some 
other range of stimuli. Two examples 
will serve to illustrate the difference 
between this situation and other situa- 
tions which are superficially similar. 

Suppose that colors are used to rep- 
resent positions on an airport surface, 
and we do an experiment in which 20 
different colors are judged by color 
naming. From the data matrix we 
compute that three bits of information 
were transmitted by the colors. This 











E sees PR eS a 
4 Ss, > Rs, 
Es 
“SMBEC 
C, 
g---2-- re 
E, 
Eo re: # 
EVENT STIMULUS RESPONSE 
CONTINUUM CONTINUUM CONTINUUM 


Fic. 1. The relation between the event, the stimulus and the response continua. A group 
of events is represented by a single stimulus, and in the ideal case a one-to-one relation between 
the presented stimulus and the verbal response or action holds. Which events (or how many) 
are represented by which particular discrete stimulus is an arbitrary matter. 
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figure means that just as much infor- 
mation about positions on the surface 
of the airport can be transmitted by 
the use of eight colors as by the use 
of the original 20. Thus, there is no 
inefficiency in the use of the smaller 
number of colors. 

There are two factors which make 
this efficiency with the smaller number 
possible. The first is that we can 
select any discrete values of stimuli; 
we are not simply pooling stimulus 
values. The second factor is that an 
exact relationship between surface po- 
sitions and the color used to represent 
the position can be: maintained. In 
other words, there is no error in the 
assignment of the color code to the 
positions on the airport surface. If 
such error did exist, then the use of 
fewer stimulus categories would result 
in less information transmitted. 

The numerical example we have 
been using will illustrate this point. 
We started with four stimuli and four 
responses, but computed that approxi- 
mately one bit of information was 
transmitted. With the interpretation 
we have given, we should be able to 
transmit just as much information 
with only two stimuli and two re- 
sponses. Reference to Table I] shows 
that if only the first and fourth stimuli 
were used, with one response category 
for each, we would have no errors for 
the 2 X 2 matrix. This situation 
would give one bit of information 
transmitted. All the events previ- 
ously represented by the second stim- 
ulus would now be represented by the 
first stimulus, and there would be no 
loss of information. Although the 
coding was coarser, fewer errors would 
be made, and the information trans- 
mitted would remain the same. 

It is clear that our real criterion is 
the correspondence between the event 
continuum and the response contin- 
uum. We are asking how much of 
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the information in the event con- 
tinuum is represented by the response 
continuum. There are other ways of 
measuring the correspondence between 
the two when the event continuum is 
quantitative. A product-moment cor- 
relation coefficient, or a total error 
variance analysis could indicate the 
degree of relationship between these 
two scales. It may not be immedi- 
ately obvious from the relations we 
have stated that in fact the precision 
with which the responses represent 
the event continuum will not become 
poorer if fewer categories are used, but 
experimental evidence for this fact will 
be presented by the authors elsewhere. 

As a second illustration, suppose 
that ratings on a scale of efficiency are 
obtained from judges about a group 
of eight men. The stimulus-response 
matrix looks as it did before, and we 
compute that two bits of information 
have been transmitted. Now, how- 
ever, we cannot say that we could 
have obtained just as much informa- 
tion about the relative efficiency of the 
eight men if we had just used four 
men, We want information about all 
of the men, and we have no means of 
pooling the men into groups to be 
represented by just one man with no 
error in the pooling. Also, we cannot 
use a rating scale of four steps, because 
it would be too coarse to allow us to 
rate some of the men accurately. The 
main point of distinction is that the 
information transmitted does not tell 
us that data can be pooled. The 
events which are being represented by 
the stimuli can be pooled (if done 
accurately) and each group repre- 
sented by a discrete stimulus. But 
if stimuli and responses are simply 
pooled into groups, information will 
be lost as iong as there is some corre- 
lation between stimuli and responses 
within the groups. Again, in Table 
II, if we pooled stimuli and responses 
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into two sets each, there would still 
be some errors, and the amount of in- 
formation transmitted would be some- 
what less than one bit. 


THE CONTINGENCY COEFFICIENT 


There is one statistical measure 
commonly used by psychologists which 
has many of the same properties as 
the measure of information. It is the 
contingency coefficient (C). Normally 
it has been used and interpreted as a 
correlational measure for non-metric 
data, and its use for this purpose per- 
haps has prevented some of its other 
uses becoming apparent. 

If the proportions in Table V were 
changed to frequencies, then we would 
have a table from which Chi-square 
and C could be computed. We have 
computed C? for the data in our ex- 
ample, and it is 0.56. This coefficient 
can be interpreted in much the same 
way as the amount of information. 
For a matrix of a given size (number 
of rows and columns) there is a maxi- 
mum C? which can be obtained. This 
maximum is (k — 1)/k, where the 
number of rows (k) is the same as the 
number of columns. (An exact limit 
for cases in which the number of rows 
and columns is not equal has not been 
determined.) Thus for a 2 X 2 mat- 
rix, the maximum C? is 0.50. Ifa C 
of approximately 0.50 is obtained from 
a matrix larger than 2 X 2, then the 
matrix can be decreased by selecting 
certain stimulus categories, and by 
grouping corresponding response cate- 
gories, and the correlation will not 
appreciably change. Thus if C* is 
known, the minimum number of rows 
and columns which would allow that 
value to be obtained can be deter- 
mined. This number is the number 
of stimulus categories for which prac- 
tically perfect response correspondence 
can be obtained. Thus C can be used 
to obtain an answer to the type of 
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question with which we are concerned. 

The fact that there is a limiting 
value of C has frequently been con- 
sidered a disadvantage of the measure. 
Actually, the implications of this limit 
are the same as the implications of the 
amount of stimulus information in 
limiting the amount of information 
which can be'transmitted. Ina2 X 2 
matrix, the! maximum information 
transmitted is one bit, and the maxi- 
mum C? is 0.50. If only the two 
diagonal cells are filled in the matrix, 
these maximum values will be ob- 
tained. The true interpretation is 
not that this is the maximum corre- 
lation which could be obtained, or the 
maximum amount of information 
which can be transmitted. Rather, 
the interpretation should be that with 
these few categories it is impossible to 
demonstrate a greater C, or a greater 
capacity for information transmission. 

Although both the amount of infor- 
mation and C are computed from the 
same type of data matrix, and al- 
though the beginning steps are the 
same, the two are not mathematically 
equivalent. If the distribution of 
errors of judgment is known, then an 
equivalence between the two measures 
can be worked out. The two meas- 
ures respond to sampling errors in 
different ways, however, so that for 
finite and reasonably small samples 
one measure cannot be derived directly 
from the other. On the other hand, 
with a Gaussian distribution of errors, 
for every amount of information com- 
puted, a C can be estimated. This 
estimation can be used to determine 
approximate probabilities for rejection 
of the null hypothesis. 

The amount of information seems 
to us to be a better measure for sev- 
eral reasons. First, it is considerably 
easier to calculate. Second, it is more 
directly interpretable in terms of num- 
ber of categories. Third, the limiting 
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value of C approaches an asymptote 
of 1.0 as the size of the matrix is 
increased, while amount of informa- 
tion can become indefinitely large. 
Finally, C is too apt to be interpreted 
as a correlation measure. It does not 
suggest new ways to attack problems, 
or new interpretations of data as easily 
as the information theory context does. 

Another application. The similarity 
between these two measures, how- 
ever, suggests another application of 
the measure of amount of information. 
It is as a preliminary determination of 
the number of categories necessary to 
get a correlation as large as the data 
can produce. For example, if in a 
preliminary test of the relation be- 
tween magnitude of GSR and light 
intensity we get a C? of 0.50, or two 
bits of information in the matrix when 
more than two intensities are used, 
then we know that we can demon- 
strate the maximum correlation by 
using only two light intensities (the 
maximum and minimum) and by 
lumping all GSR magnitudes into two 
categories. The correlation will not 
be decrease! by doing this, and the 
amount of information about the rela- 
tionship wiil not decrease. If for 
practical reasons it is desirable to use 
fewer categories, and perhaps more 
cases in each category, then we know 
that we shall not have lost any infor- 
mation in so doing. 


SELECTION OF THE STIMULI 


Computation of the amount of in- 
formation transmitted indicates the 
minimum number of stimulus cate- 
gories which can be used to obtain 
the maximum amount of informa- 
tion about a given set of possible 
events. The computation, however, 
does not indicate which particular 
stimuli should be used. There are 
two major criteria in selecting the par- 
ticular stimuli: The first is to have 
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stimuli which Os will tend to cate- 
gorize by using each response alterna- 
tive as often as each corresponding 
stimulus is presented. If relatively 
few categories are used, there is prob- 
ably little difficulty in maintaining 
this criterion by instruction. The 
second criterion is to have as few 
errors as possible. 

If the stimuli are on a true con- 
tinuum, such as brightness or loud- 
ness, then we want to select stimuli 
which are equally discriminable from 
each other. If we assume that errors 
of judgment of loudness, for example, 
are distributed according to the Gaus- 
sian curve of errors, then the fewest 
total errors will occur if the stimuli are 
selected to be an equal number of 
standard scores apart. For example, 
suppose we have two stimuli locating 
the end-points of our scale, and we 
want to place a third stimulus in be- 
tween in such a manner as to minimize 
the judgmental errors. If we place 
the third stimulus ha'*vay between 
the other two on a sci. of discrimi- 
nability, then errors will se minimized. 
If the third stimulus is moved closer 
to one of the other two, errors of con- 
fusion will increase between those two 
at a greater rate than they decrease 
between the middle stimulus and the 
other end-point. With other types of 
error distribution, of course, this rule 
might not hold. 

We can select stimuli according to 
these criteria by inspection of the data 
and perhaps a little trial-and-error. 
It would be better, however, if we 
could determine the psychological 
properties of the stimulus continuum 
in such a manner that we could make 
a more exact prediction of the stimuli 
which should be selected, and for 
many different circumstances. Basi- 
cally we need a scale of equal dis- 
criminability. A scale of integrated 
difference limens could be the atiswer, 
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but difference limens are obtained from 
quite a different judgmental situation. 
If we could assume that judgments of 
the type we have been talking about 
are made as equal-interval judgments, 
and that the criterion for equal inter- 
vals is discriminability, then we could 
use the scale computed directly from 
the responses as our scale of equal 
discriminability. Two previous au- 
thors, however, have showr that abso- 
lute judgments are not made with all 
response categories used as equal dis- 
criminability steps. Usually judges 
discriminate between the end cate- 
gories more than between the middle 
categories. Saffir (6) first pointed out 
a method for correcting the scale and 
establishing a true scale of equal dis- 
criminability. He called his technique 
the method of successive intervals. 
Later Attneave (1) developed a tech- 
nique which he called the method of 
graded dichotomies. 


CONSTRUCTION OF A SCALE OF 
EQuaL DISCRIMINABILITY 


This section describes a third tech- 
nique for constructing a scale of equal 
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ciscriminability. The scale values ob- 
tained are basically the same for all 
three techniques, although the differ- 
ent techniques react differently to 
errors of sampling. 

The computations will be illustrated 
with data obtained in an experiment 
which required absolute judgments of 
the loudness of tones presented once 
every six seconds. There were twenty 
stimulus intensities ranging from 5 to 
100 db S.P.L., in 5 db steps. Judg- 
ments were required on a 21-point 
scale (0-20), where ‘‘0” meant that a 
tone was not heard, and ‘‘20’’ meant 
it was the loudest in the series. The 
computational steps are as follows: 


(1) Arrange the data in a matrix of the 
form illustrated in Tables I and II. 

(2) Cumulate the frequency of responses 
by successive response categories for each 
stimulus separately. For example, the 
stimulus intensity of 60 db produced 
judgments at response categories of 3, 4, 
5, 6, and 7 with frequencies of 32, 35, 15, 
11, and 7. The cumulated frequencies 
are 32, 67, 82, 93, and 100. Thus the 
number 93 means that this particular 
stimulus was called ‘6’ or some lower 
number 93 times. 


















Fic. 2. Cumulative frequency of loudness judgments expressed in standard scores. Each 
curve is for a different stimulus intensity, starting with the lowest intensity on the left. 
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Fic. 3. Cumulative frequency of loudness judgments expressed in standard scores after 
the spacings of the response categories have been adjusted for equal discriminability. The 
data are the same as in Fig. 2, but have been plotted with the adjusted scale of response cate- 
gories. The actual response category is shown at the top. The R-scale value for the cate- 


gories is shown at the bottom. 


(3) Convert the cumulative frequencies 
into proportions of times that each stimulus 
is called a particular response category or 
lower. In the example given, since the 
total presentations of each stimulus were 
100, the proportions are: .32, .67, .82, .93, 
and 1.00. 

(4) Convert the cumulative proportions 
into their equivalent standard scores. In 
the example these become —.47, +.43, 
+.92, and +1.48. No standard score 
can be determined for a proportion of 
1.00, of course. 

These cumulative frequencies expressed 
in standard scores for all 20 stimuli are 
plotted in Fig. 2. It is apparent from 
this plot that all response categories were 
not used with equal discrimination, since 
the curves for the lower intensities are 
much steeper than those for the higher 
intensities. What is needed is some 
means of adjusting the positions of the 
response categories so that all curves will 
have the same slope. If the errors are 
normally distributed, then all curves will 
also be straight lines. The technique 
used is to determine the change in the 
ordinatc for a unit change in the abscissa, 
and to use this measure of the slope to 
determine the relative spacings for the 
response categories. 


(5) Compute the average difference in 
standard scores for each successive pair of 
response categories. For the particular 
stimulus we have been using as ar illus- 
tration, the difference between response 
categories 3 and 4 is .90, and that be- 
tween 4 and 5 is .49. However, the 
curves for four other stimulus intensities 
also can be used to compute the differ- 
ence for categories 3 and 4, and three 
others for the difference between 4 and 5. 
For each successive pair of categories the 
average of all differences which can be 
computed is determined. The number 
that can be averaged will vary depending 
on which pair of response categories is 
being considered. 

(6) Arbitrarily assign the lowest response 
category a value of 0, and then add the suc- 
cessive differences to obtain the values for 
each successive category. Anew scale (the 
R-scale) value has now been determined 
for each category, and a plot of the cumu- 
lative curves with these new scale values 
is shown in Fig. 3. Now all curves are 
essentially straight lines with the same 
slope. 

(7) Determine the R-scale value equiva- 
lent to the median response for each 
stimulus intensity. It is easiest to do the 
calculations from the cumulative curves 
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Fic. 4. The equal-discriminability scale values as a function of stimulus intensity for the 
data shown in Fig. 2. The E.D.-scale value for each intensity is determined from Fig. 3. 
It is the R-scale value which (for each stimulus intensity) corresponds to a standard score of 0 


on the ordinate. 
a particular stimuius intensity. 


as shown in Fig. 3, and any of a number 
of techniques can be used: Visual esti- 
mation, linear interpolation, method of 
averages, least squares, etc. Nowascale 
value has been determined for each stim- 
ulusintensity. The final scale computed 
with this technique is shown in Fig. 4. 
We have called it an equal-discrimina- 
bility (Z.D.) scale, 

It should be pointed out that the final 
scale is one in which equal distances on 
the E.D.-scale represent equal extents of 
discriminability or confusability. The 
correction of the response scale has re- 
sulted in a scale in which equal distances 
on the £.D.-scale represent equal tend- 
encies to call two stimuli the same re- 
sponse. Thus any peculiar use of the 
response categories has been ruled out. 
For example, occasionally Os refuse to 
use particular categories. The final scale 
computed has cancelled out this effect by 
being only a measure of the extent to 
which two stimuli are called the same 
response, regardless of what that response 
is. Furthermore, neither the R-scale nor 
the £.D.-scale has an absolute zero. 
Thus the scales have only the properties 
of equal-interval scales, not of ratio scales. 

If we now wanted to select just four 
stimuli from this range of intensities, we 


In other words, it is the R-scale value equivalent to the median response for 


would select 5,40, 82, and 100 db, since 
these four stimuli are equally spaced on 
the E.D.-scaie. 


Practical considerations. Frequently 
practical considerations require a num- 
ber of stimuli greater than the mini- 
mum number necessary to get maxi- 
mum information transfer. In such 
cases the proper selection of the stimuli 
is even more important if information 
transfer is to be maximized. There 
is one particular situation in which 
proper selection of the stimuli by using 
the E.D.-scale can partially offset some 
disadvantages produced in the original 
event continuum. 

Suppose that practical considera- 
tions require us to use a number of 
categories sufficiently large that we 
shall always have errors of judgment. 
Suppose also that the events which 
are being represented do not occur 
equally often. It is apparent from 
the calculation formulae that an un- 
equal probability of occurrence will 
decrease information transfer. How- 
ever, this effect can be partially offset 
by spacing stimuli far apart when they 
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are used frequently, but closer to- 
gether when they are used infre- 
quently. Thus the probability of 
error is decreased for frequent occur- 
rences at the expense of an increased 
probability of error for infrequent 
occurrences. For every set of event 
probabilities, there should be some 
optimum spacing of the stimuli on the 
E.D.-scale. The precise determina- 
tion of the optimum spacing for a 
particular set of probabilities is a com- 
plicated calculation. But it should 
be possible to work out some rela- 
tively simple rules-of-thumb to handle 
these cases. 

Ideally, of course, we would prefer 
to avoid this type of problem by 
grouping the various events into sets 
which have equal probabilities of oc- 
currence. Thus all stimuli would 
occur equally often, and the system 
would produce maximum information 
transmission. As a practical matter, 
however, other considerations will fre- 
quently determine the groupings. The 
next best thing is to compensate for 
the unequal probabilities as we have 
suggested. 

SUMMARY 


1. With absolute judgments, or 
stimulus ratings of stimulus cate- 
gories, the judgments of an O indicate 
how accurately the O perceived which 
of several alternative stimuli occurred 
on a particular presentation, or how 
much information the O obtained 
about which stimulus occurred. 

2. The amount of information which 
the stimuli transmit to an O can be 
measured in bits. 

3. This measure gives an estimate 
of the minimum number of stimulus 
categories which can be used to trans- 
mit the maximum amount of informa- 


‘4 This is essentially the problem of coding 
against noise in communication systems by 
the adjustment of relative input rates. This 
problem is discussed by Laemmel (2) and 
Ruark (5). 
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tion in those cases where the stimulus 
is a symbol for a group of events, and 
where the coding of the events to the 
symbol can be done with no error. 

4. The contingency coefficient has 
some of the same properties as the 
amount of information. 

5. In order to select the stimuli for 
maximum information transmission a 
scale of equal discriminability is re- 
quired, and a technique for construct- 
ing such a scale is described. 

6. The scale of equal discrimina- 
bility can also be used to select stimuli 
which will partially compensate for 
an unequal frequency distribution of 
the events being represented. 
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PERCEIVED SIZE AND DISTANCE IN VISUAL SPACE 


BY ALBERTA S. GILINSKY 


In a recent survey of the status of 
research on visual perception, Graham 
(8) points out that the facts of space 
discrimination are still poorly under- 
stood. “In particular a long series of 
researches has not resulted in those 
systematically determined functional 
relations which, as Holway and Bor- 
ing (1941) say, are ‘wanted and 
wanting’”’ (8, p. 876). 

The purpose of this study is to 
develop a quantitative formulation of 
visual space perception expressing 
such functional relations. Functions 
relating visually perceived size and 
distance to true size and distance are 
rationally derived and then applied 
to the results of a variety of visual 
size and distance experiments. 

The theory is expressed quantita- 
tively in two simple, interrelated 
formulas—for perceived distance and 
perceived size, respectively. These 
two formulas are derived mathemati- 
cally in three different ways: First, the 
two formulas are rigorously derived 
from the basic metric of visual space 
as established mathematically (for 
binocular vision) by Luneburg (12). 
Second, the same two formulas are 
mathematically derived (somewhat 
less rigorously but without restriction 
to binocular vision) from the known 
principles of visual perspective. Fi- 
nally, the same two formulas are de- 
rived by a simple inductive method of 
mathematical composition for the 
two boundary laws of size constancy 
and retinal image (visual angle). 
All three methods of derivation yield 
the identical pair of formulas to ex- 
press a unifying law of visual space 
perception. 

In place of three different observer 
constants and several unstated scale 





constants contained or required to be 
supplied in Luneburg’s formulas, the 
present simplified formulas contain 
essentially a single constant or param- 
eter, A. This parameter, A, varies 
for different observers and with the 
conditions of -the experiment. The 
numerical value of A for a given 
observer may be expected to depend 
upon the availability of cues to dis- 
tance. Through the perfection of 
techniques whereby the value of A 
may be reliably determined for indi- 
vidual observers and for different 
conditions, the two formulas can be 
made into a useful tool to enable indi- 
viduals to estimate true sizes and dis- 
tances from visual observations at a 
distance. A practical measure or 
specification may also thus be made 
available for classifying or grading 
individuals with respect to their dis- 
tance judgment or spatial discrimina- 
tion. 

The general validity of the two 
formulas is supported by visual size 
and distance experiments planned 
and conducted for this study. They 
are also checked by visual size experi- 
ments by prior investigators (9, 16) 
and other observations recorded in 
the literature, such as perceived size 
in stereoscopic photographs (18), the 
alley experiments (2), and the visual 
speed of moving objects (5). A 
further striking check (seemingly un- 
predicted) is the applicability of the 
same distance-formula to perceived 
distance determined by auditory stim- 
ulation (17). This variety of evidence 
indicates that the derived relations 
and the unifying law which they ex- 
press may have considerable general- 
ity. 

Visual Space. We open our eyes 


460 





PERCEIVED SIZE AND DISTANCE IN VISUAL SPACE 


and perceive a_ three-dimensional 
world in which objects have perceived 
size, form, and localization. Our 
experience is called visual space. ° 

Visual space and physical space are 
not identical. Neither is visual space 
in three dimensions a proportional 
replica of objective space in three 
dimensions. One is a distorted trans- 
formation of the other. As Ogie has 
stated: 


“. . . an object in objective space may 
be displaced without deformation and 
without changes between related points, 
. . . in visual space, this is not true. 
As an object recedes, not only does it ap- 
pear to become smaller but its shape 
appears to change, this change being 
greater in the depth extent than in 
height or width. Many other dis- 
crepancies between the two spaces have 
been established. . . . It is only impor- 
tant that, whatever relationship exists 
between objective and visual space, it 
should be fairly stable if the individual is 
to act effectively in the physical world” 
(13, p. 11). 


In the prior search for a simple la 
of visual space perception, two princi- 
pal theories have been held—each 
incompatible with the other, and 
each beyond certain limits becoming 
inconsistent with known phenomena 
and relations. 


1. At comparatively short distances, 
well within the range of accustomed ex- 
perience or confirmation by other senses, 
the organism tends to utilize corrective 
cues so as to compensate for the distance 
and the corresponding diminution of 
retinal size. Observations under such 
limiting conditions have given rise to 
the so-called law of siz: constancy, 
whereby the perceived size of an object 
is taken as constant and independent of 
retinal size (1, 8, 9). 

2. At the other extreme, if size of 
retinal image is assumed to be the sole 
cue to perceived size, and if all depth or 
distance perception is ignored, the so- 
called law of the retinal image or visual 
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angle is obtained, whereby the perceived 
size of an object is taken as simply pro- 
portional to the size of the retinal image 
(or visual angle) and therefore inversely 
proportional to the distance of the ob- 
ject. This theory is more nearly true at 
very great or astronomica! distances 
(1, 3, 8, 9). 


Neither the law of the retinal image 
nor the opposite theory, the law of 
size constancy, can be sustained over 
the entire range of distances and condi- 
tions (2,4,8,9). As Boring (2, 4) 
has shown, each view describes only 
a special limiting case. An adequate 
general theory must be capable of 
satisfying these boundary conditions 
and of providing, as well, for a smooth 
continuous transition over the large 
intervening range between the extreme 
boundary laws. The aim is to find a 
unifying law applicable to all dis- 
tances and to all conditions of visual 
perception. 

The various theories of perceived 
size may also be compared by relating 
them to another concept—“the pro- 
jection theory of visual perception” 
(10,12). If the two base points are 
the centers of rotation of the eyes and 
the projection lines are the optical 
axes, the intersection of the two pro- 
jection lines will be at the actual 
point fixated and therefore at the true 
physical distance D. If the retinal 
image is projected to this true physi- 
cal distance D, the perceived size will 
be undiminished by distance and the 
law of size constancy is obtained. If, 
on the other hand, the retinal image 
is always projected to an arbitrary 
constant distance, independent of 
true distance, the perceived size will 
be simply proportional to the size of 
the retinal image, without regard to 
distance, and the law of the retinal 
image is obtained. As stated above, 
neither of these two extreme laws is 
adequate. 
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The perception of distance or depth 
is a fact, and must be included in a 
generalized theory. But that does 
not mean the true or physical dis- 
tance D, for physical distance is not 
perceived. What is sensed or per- 
ceived is the perceived distance d, and 
that is the only distance that is sub- 
jectively realized and therefore the 
only distance at which the projection 
can, logically and consistently, be 
made. If the size (in radians) of the 
retinal image or visual angle is g (phi), 
and if the perceived distance is d, 
then the perceived size s, is given by 
the simple relation 


(1) 


s = o-d. 


According to our hypothesis, this is 
the basic, unifying relation between 
perceived size and perceived distance. 

The significance of this concept for 
our understanding of space perception 
is that it unifies a large number of 


previous observations, embracing 
both size constancy and retinal image 
as limiting cases, and bridging the 
gap between these two extremes. 
Perceived distances are foreshort- 
ened. The perceived distance d in- 
creases with the true distance D bui 
at a reduced and diminishing rate. 
Consequently the law of perceived 
size given by equation (1) is inter- 
mediate between the law of retinal 
image and the law of size constancy. 
Moreover, the perceived distance 
d approaches a finite limit A when the 
true distance D increases to infinity. 
As Luneburg has stated: “Astro- 
nomical objects like the sun or the 
moon are seen at finite distances; 
their sensed size is also finite and in no 
way propertionai te astronomical 
dimensions. Even the sky itself zives 
the impressioa ot a deme of finite 
radius” (12, ». i‘ 
As thus powied cut by Luneburg, 
and earlier, by James (10), there 
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exists a maximum limit of perceived 
distance. Whereas physical space is 
infinite, visual space is finite. This 
concept of an upper limiting thresh- 
old (or limen) for spatial discrimina- 
tion or distance perception is consist- 
ent with the known facts of spatial 
discrimination and with sensory dis- 
crimination in general (8,19). The 
question as to what physiological 
mechanism or receptor process governs 
or underlies this maximum limit of 
visual distance is immaterial for our 
purpose. 

Luneburg was principally concern +d 
with investigating the geometric. 
character (hyperbolic, Euclidian, or 
elliptic) of the tridimensional mani- 
fold comprising visual space. He 
shows that “the geometry in any. 
manifold can be derived from its 
metric, i.e., from a rule for measuring 
small line elements.” The problem 
is thus “‘to establish a metric for the 
manifold of visual sensations” (12, 
p. 2). The basic metric given by 
Luneburg establishes the general rela- 
tion between the two spaces, visual 
space and physical space. Using 
Luneburg’s basic metric as our point 
of departure, we shall derive our own 
psychometric coordination, based on 
the Euclidian geometry, for formulat- 
ing the relation between visual and 
physical space. 


DEFINITIONS: PERCEIVED SIZE 
AND DISTANCE 


The real objective size of a physical 
object is subjectively unknowable. 
We can, however, apply the familiar 
physical and geometrical operations 
of measurement, starting with the 
simple relation of equality, so as to 
determine the relative physical sizes of 
objects. A standard unit measuring 
rod supplies the objective scale. The 
objective size measured on this scale 
will be designated by So. 
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Similarly, the subjective size s of 
visual objects cannot be objectively 
determined. We can, however, apply 
various procedures of matching and 
comparison, limited to an individual 
observer and constituting psycho- 
physical measurement. Starting with 
the simple relation of equality, we may 
thus determine (for a given observer) 
the relative perceived sizes of different 
objects, and also the relative per- 
ceived sizes of the same object at dif- 
ferent distances. Moreover, we can 
select one of these viewing distances as 
“the normal viewing distance’ 6 
(delta) and we can call the correspond- 
ing perceived size of the object the 
“true size” S to be used as a standard 
reference size for the visual perception 
of the object. This concept has been 
anticipated by William James: 


“Out of all the visual magnitudes of each 
known object we have selected one as the 
REAL one to think of, and degraded all 
the others to serve as its signs. This ‘real’ 
magnitude is determined by aesthetic 
and practical interests. It is that which 
we get when the object is the distance 
most propitious for exact visual dis- 
crimination of its details. This is the 
distance at which we hold anything we 
are examining. Farther than this we 
see it too small, nearer too large. And 
the larger and the smaller feeling vanish 
in the act of suggesting this one, their 
more important meaning” (10, p. 179). 


The physical measuring rod used for 
objective size becomes also the meas- 
uring rod for subjective size provided 
the rod is held at the normal viewing 


distance 6 (delta). We assume equal- 
ity of visual size when object and 
measuring rod have equal physical 
size and are held at identical viewing 
distance. 

When the object is mcre remote, 
we hold the measuring rod at the 
normal viewing distance 6 and then 
transfer its visual size to superposition 
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upon the object at its perceived dis- 
tance d. The equivalent procedure 
is to transfer the visual size of the 
distant object to the measuring rod 
(or comparison standard) held at the 
normal viewing distance 6. This 
technique must be included in the 
instructions to the observer and must 
be learned by practice. In each case 
a recalled visual size must be com- 
pared with an observed visual size. 
The two visual sizes (object and 
standard at disparate distances) must 
not be compared directly by simul- 
taneous viewing because that tends to 
introduce comparison of retinal images 
instead of comparison of subjective 
sizes. 

Finally, we have to define and estab- 
lish a measure of subjective distance. 
For this purpose we note that the 
transformation from physical to visual 
space is cubic or isometric at short 
distances such as the normal viewing 
distance 6 and that the depth dimen- 
sion becomes. perceptively com- 
pressed at greater distances. Accord- 
ingly, the same measuring stick used 
for perceived size may also be used 
as the measuring stick for perceived 
distance; in each case it should be 
held at the normal viewing distance 6. 

A physical cube will be perceived 
as a cube if viewed at the normal 
viewing distance 5. At more remote 
distances the perceived thickness or 
depth will diminish more rapidly than 
the frontal dimensions. The receding 
solid will cease to appear as a cube in 
visual space unless it is physically 
elongated as it recedes to increasing 
distance from the observer. The sun 
and the moon lose their spherical 
thickness and appear as flattened 
disks against the sky. 

A line in physical space may be bi- 
sected. A line in visual space may be 
bisected. The two division points are 
not necessarily identical. If the line 
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recedes from the eye, as a distance, the 
two divisions will be different. 

A yardstick in physical space may 
be moved to different points or posi- 
tions without changing its physical 
length, but its visual length will 
change. A yardstick in visual space 
may be moved to different points or 
positions without changing its visual 
length, but its physical length would 
have to be changed. The constant 
physical yardstick is used for measur- 
ing physical lengths. The constant 
visual yardstick is used for measuring 
visual lengths (perceived sizes and 
distances). 

In order to correlate the two yard- 
sticks, we define the visual yardstick 
as the perceived size of the physical 
yardstick held at a convenient normal 
viewing distance 4 (delta) from the eye. 
By this expedient we get around the 
difficulty of correlating two incom- 
mensurables—an objective magnitude 
and a subjective magnitude. 

To apply the visual yardstick to a 
distant object, it is assumed that the 
visual yardstick is moved to the dis- 
tant object without changing the 
visual length of the yardstick. 

A distance from the observer (Q) is 
measured physically by repeatedly 
applying the physical yardstick, 
thereby dividing the distance into 
physically equal divisions. The suc- 
cessive divisions will appear shorter 
as they recede from the observer. To 
measure the distance visually, the 
visual yardstick must be applied re- 
peatedly so as to divide the distance 
into equa! visual divisions. These 
successi. divisions will be longer as 
they < «cede from the observer. 

The meaning of perceived size s is 
thus linked to the meaning of per- 
ceived distance d. Visual spatial ex- 
tents are measured in all three di- 
mensions of visual space by one and 
the same subjective measuring rod or 





ALBERTA S. GILINSKY 


visual yardstick. (The physical 
length of this yardstick will vary 
with distance and direction. Only at 
the standard viewing distance 6 are 
the physical and visual yardsticks 
equal and invariant.) 

For the purpose of the present 
study, it is extremely important to 
draw a clear distinction between per- 
ceived distance and estimated dis- 
tance, corresponding to the similarly 
important distinction drawn by Her- 
ing and others (2) between perceived 
size and estimated size. Perceived 
distance is phenomenal or apparent 
distance, comprising exclusively the 
direct product of stimulation (visual 
and muscular). Estimated distance 
includes an intellectual correction of 
perceived distance, derived from past 
experience and training, to arrive at 
a more informed inference or judg- 
ment of true distance. Perceived 
distance is represented by the reduced 
magnitude d. Estimated distance is 
an attempt, conscious or unconscious, 
to estimate (from d) the true distance 
D. 

When a subject says that a distant 
object appears to be one mile away, 
he means that it appears as far away 
as an object known to be one mile 
away. That is an absolute judgment 
(based on past experience) and not a 
measure of perceived distance. For 
a consistent definition of perceived 
distance d, when we say that a per- 
ceived distance is 40 yards we mean 
that the distance is perceived (in a 
subjective scale of visual distance) as 
twice as big as a perceived distance 
of 20 yards, or 40 times as big as a 
perceived distance of one yard—40 
visual yardsticks laid end to end. 
The physical lengths of these visual 
yardsticks would increase progres- 
sively as they recede from the eye. 
Accordingly, the perceived distance 
of an object 100 yards away may be 
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only 30 or 40 yards, although we may 
have learned, by training or experi- 
ence, to judge it as 100 yards away. 
Similarly, the perceived distance of 
the horizon or the moon may be only 
50 yards away by this definition. 

For a consistent definition of per- 
ceived size s, the constant visual yard- 
stick must be applied (in imagination 
or projection) to the distant object at 
its perceived distance. O should be 
guided to concentrate on the apparent 
size of the distant object at the dis- 
tance at which O projects or visualizes 
the distant object. 


Imagine an arm, capable of indefinite 
extension, holding the visual yardstick, 
maintaining the apparent length of the 
stick unchanged as it moves through 
space to make possible a subjective super- 
position of the measuring stick upon the 
distant object, say, the moon. If the 
imaginary arm carries the measuring 
stick farther than the perceived distance 
of the moon, the resulting measurement 
of perceived size will be too big. If the 
distance to which the visual yardstick is 
carried falls short of the perceived dis- 
tance of the moon, the resulting measure- 
ment of perceived size will be too small 
(cf. 11, p. 237). The various answers 
given by different individuals to the 
question, How large is the moon?— 
answers which vary from } inch to 30 
feet—illustrate this strikingly. Instruc- 
tions to concentrate on the apparent 
size of the moon at its perceived distance 
yield equivalence matchings within a 
much narrower range, 4 to 20 inches. 
The variation which remains must be 
largely attributed to the fact that indi- 
viduals differ in their perception of 
spatial depth or distance, particularly 
when, as in this case, the actual distance 
is so great anc beyond the range of 
experience through the senses. 


PERCEPTION OF DEPTH 


In the perception of visual space 
the two essential elements are size of 
retinal image (or visual angle) ¢ (phi) 
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and the perception of depth or dis- 
tance d. Both retinal and muscular 
sensations are involved. 

In binocular vision the basic essen- 
tial fo: , ception of depth or distance 
is the convergence (or the horizontal 
disparity or parallax) y (gamma). The 
interocular or interpupillary distance 
provides the psychophysical base line 
for the geometry of binocular space 
perception. Although Luneburg re- 
fers to y sometimes as “horizontal 
disparity’’ and sometimes as “bipolar 
parallax,’ he writes: 


“The meaning of ¥ is clear: the angle 
subtended by the lines of sight at the 
point of convergence, P’’ (12, p. 13). 


In monocular vision the correspond- 
ing physiological cue is the muscular 
or kinaesthetic sensation of accommo- 
dation. 

In addition to these primary or basic 
cues of convergence and accommoda- 
tion, there is a variety of supplement- 
ary clues which aid the perception 
of spatial depth. These associated 
or auxiliary clues include linear per- 
spective, aerial perspective, interposi- 
tion, movement parallax, texture gradi- 
ent, light and shade, and apperceptions 
from past experience. 

Luneburg does not attempt a phy- 
siological explanation of the relation 
of size perception to convergence, but 
he does consider this relation to be a 
necessary hypothesis. “Our scale of 
size seems to contract with increasing 
values of y” (12, p. 103). 

We need not consider the specific 
physiological mechanism whereby 
depth or distance in visual space is 
perceived. Such discussion is not es- 
sential for the development of the 
main thesis of the present study. 


THE Basic FORMULAS 


The mathematical derivations of 
the two fundamental interrelated 
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formulas for perceived distance and 
perceived size are given in the Ap- 
pendices. All three methods of deri- 
vation yield the identical pair of 
basic formulas. 

The basic formula for perceived 
distance is: 

d A 
where d = perceived distance, D = 
true (physical) distance, and A = 
maximum limit of perceived distance 
for a given O under given conditions. 

In fact, the constant A is self-defined 
by the formula. For D = 0, d = 0; 


Physical Scale (AD) 
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The fact that the sky generally ap- 
pears like a flattened bowl indicates 
that A has a lower value for pure space 
perception (elevated vision without 
foreground) and a higher value for 
horizontal vision (with foreground), 
due to differences in clues and context 
(the ‘moon illusion’”’). 

The high relative compression of 
visual space into a limiting boundary 
compared with the corresponding 
infinite expanse of physical space is 
impressively illustrated by the foliow- 
ing numercial example (caiculated 
for a convenient assumed value of 
A = 100 ft.): 


Visual Scale (Ad) 


The first 100 ft. of D will be perceived as 50 ft. = 3A 
The next 200 ft. of D will be perceived as 25 ft.= 3A 
The next 400 ft. of D will be perceived as 12.5 ft. = 3} A 
The next 800 ft. of D will be perceived as 6.25 ft. = A 
The next 1600 ft. cf D will be perceived as 3.125 ft. = #4 A 

but for D = infinity, d= A. In Summation (limit) 

other words, A is the apparent dis- For D=infinity, d=100 ft.=A 


tance of objects at infinity. Per- 
ceived distance ¢ increases with true 
(physical) distance D; but the curve 
of d plotted on D, starting with the 
slope d/D = 1, flattens out and be- 
comes asymptotic to the limiting 
horizontal line, d = A. 

This limiting value, d = A, in the 
basic formula is given by the apparent 
distance of astronomical objects; it is 
also equal to the visual distance from 
the observer to the perspective hori- 
zon (vanishing point for parallel lines 
receding from the observer). 

The O constant or parameter A is 
a measure of the finite depth of visual 
space for a given O under given con- 
ditions. An O with a higher value of 
A has an expanded visual world with 
a wider or farther horizon. An O 
with a lower value of A has a more 
compressed or limited visual world 
with a closer sky and a closer horizon. 


Accordingly, distance discrimina- 
tion (Ad/AD) rapidly diminishes with 
distance. 

The larger the value of A, the more 
nearly perceived distance d approxi- 
mates true distance D. For A = 
infinity, perceived distance would be 
equal to true distance. 

The plotted functions representing 
equation (I) for several values of A 
are shown as a family of curves in Fig. 
1. Perceived distance d is plotted 
against true distance D, in the same 
units and to the same scale. The 
diagonal straight line forming the 
upper limiting boundary represents 
the function for true or undiminished 
distance perception. It is obtained 
from equation (I) with A = infinity. 
The curves below this boundary line 
express equation (I) with respective 
numerical values of A corresponding 
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to 50, 60, 70, 80, 90, 100, 120, and 200 
ft. Larger values of A shift the 
. position of the function upward to- 
ward the diagonal line for true dis- 
tance; smaller values of A shift the 
function downward toward minimum 
perception of distance, corresponding 
to the lack of adequate cues for dis- 
tance perception. 
The basic formula for perceived 
size is 
Bei. B 
S A+D’ 
where s = perceived size, S = sub- 
jective true size (perceived size at 
normal viewing distance 6,) and B = 
A+46. The terms A and D retain 
the definitions of equation (I). The 
correction term 6 (delta) is the dis- 
tance at which an object is viewed for 
subjective true size. This correction 
makes perceived size equal to true 
size (s = S) notat D = 0, but at some 
small value of D = 6. Thus, an ob- 
ject appears to be true size when it is 
presented at some normal viewing 


(ID) 
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distance (about 2 feet) in front of the 
observer. This constant 6 may vary 
with the O, also with different objects 
for the same O. This device permits 
us to measure perceived size s (at 
various distances) and subjective true 
size S in the same units and to the 
same scale. The subjective scale by 
which s and S are measured is not in- 
volved and need not be known or ex- 
pressed. We work only with the 
ratio s/S. Both are subjective mag- 
nitudes, and the scale is immaterial. 

Accordingly, the three distance 
terms (A, B, D) may be given in one 
set of units, and the two size terms 
(s and S) in another set of units. 

The plotted functions representing 
equation (II) are shown as a family 
of curves in Fig. 2. The ratio of per- 
ceived size to true size (s/S) is plotted 
as a function of real distance D. For 
A = infinity, the upper horizontal 
boundary line is obtained, represent- 
ing s/S = 1; this relation is size con- 
stancy. For A = 0, the bottom hy- 





8 


8 


Perceived Distance (d) in feet 





i 





i" 1 1 ‘. 1 





r 


Phy. 


Seer “ 
100 120 140 160 180 


80 
«4 Distance (D) in feet 


Fic. 1. Theoretical functions for perceived distance. The graphs of equation (I) showing 
perceived distance plotted against physical distance for various assumed values of A. For 
A = equation (I) yields the diagonal line representing veridical distance perception. Be- 
low this boundary line decreasing values of A displace the curves downward toward minimum 


distance perception. 
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perbolic boundary curve is obtained, 
representing s/S = 5/D; this is the 
iaw of retina! image. Both of these 
special cases or boundary laws are 
given by equation (11) upon substitut- 
ing the appropriate values of A (zero 
or infinity). Intermediate values of 
A (such as A = 50 to 200 feet) yield 
the intermediate curved graphs shown 
in Fig. 2. 

The single parameter A in both 
basic formulas thus governs the posi- 
tion of the respective functions be- 
tween the limiting boundary laws. 
For perceived distance, the magnitude 
of A determines the position of the 
visual distance function between the 
two extremes of d = D (or perceived 
distance identical with true distance) 
and d ‘= 0 (or greatly diminished dis- 
tance perception). In similar man- 
ner, the magnitude of A governs the 
position of the visual size function 
between the two boundary conditions 
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representing size constancy and the 
opposite extreme of perceived size 
proportional to and dependent only 
upon retinal image or visual angle. 

Accordingly, the magnitude of A is 
a simple, convenient, and direct 
measure or index of what has been 
termed by prior investigators ‘‘phe- 
nomenal regression to the real object’’ 
(16). A zero value of A represents 
zero “regression,” and an _ infinite 
value of A represents complete or 
one hundred per cent “regression.”’ 
An advantage of such use of A over 
other indices of “regression” is that it 
is a constant for a given O under given 
conditions over the entire range of an 
experiment, whereas other indices 
vary from observation to observation 
or.from point to point of a plotted 
experimental curve. 

The empirical value of A (for a 
given O under given conditions) may 
be determined by any two or more 
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Theoretical functions for perceived size with distance variant. 


The graphs of 


equation (II) for various assumed values of A. The perceived size is shown as ratios to the 
true size (perceived size of the stimulus object at a normal viewing distance, taken as 2 feet). 
The value of A governs the position of the functions between and inclusive of the horizontal 
solid line representing the law of size constancy and the bottom curve representing the law of the 


retinal image or visual angle. 


The horizontal dashed line cuts the curves at s = 45S, indicating 


the distance at which this ratio may be expected for different values of A. 
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observations at different distances, or- 
by a single observation of an astronom- 
ical object. (See Appendix IV.) 

Instead of concentrating upon the 
effect of varying the parameter A, we 
may also study equation (II) with 
reference to the effect of varying the 
distance D. For D = 5, s = S, and 
the object is seen as ‘‘true size.”” As 
the object is brought closer to the 
eye, perceived size s increases some- 
what; and when the object is moved 
away from the eye, perceived size 
diminishes. (Luneburg’s derived for- 
mulas yield the anomalous result of 
making perceived size diminish steeply 
when the object is brought close to the 
eye. See his Fig. 41.) 


In further reference to equation 
(11), for small values of the distance 
D (relative to A), s/S is close to unity, 
corresponding to the law of size con- 
constancy; and for very large distances 
D (relative to A), s/S is greatly di- 


minished and varies inversely with D, 
corresponding to the law of retinal 
image. 

The basic formulas (I and II) are 
thus seen to express a law intermedi- 
ate between size constancy and retinal 
image while yielding these two ex- 
treme conditions at short viewing dis- 
tances and at astronomical distances, 
respectively. 

In the following sections the two 
basic formulas (I and II) will be ap- 
plied to recorded results of prior ex- 
periments and to new experiments in- 
volving observations of perceived 
size and distance. These data will 
provide an initial empirical test of 
the validity and practical usefulness 
of the present theoretical formulation. 


EXPERIMENTAL CONFIRMATION 


Perceived Size. A number of pre- 
viously recorded experiments, in which 
the subject adjusts the size of a com- 
parison stimulus at a constant dis- 
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tance to match a standard stimulus 
at a variable distance, yield results 
which correspond neither to the law of 
retinal image nor to the law of «ize 
constancy. 

Instead, beginning with the first 
systematic experiment on perceived 
size (Martius, 1889), the size matches 
consistently follow an intermediate 
law (2, 8, 9, 11, 16). The dependence 
of perceived size upon distance is 
found to be in good agreement with 
the prediction expressed by equation 
(II) for perceived size. The slope of 
the function relating the adjusted size 
of the comparison object and the dis- 
tance of the standard obiect is affected 
by controlling circumstances (4, 8, 9, 
18). These variations are reflected 
by appropriate changes in the magni- 
tude of the parameter A in equation 
(II). 

The most adequate investigation oi 
this problem was performed by Hol- 
way and Boring (9). Their experi- 
ment measured perceived size at dis- 
tances up to 120 feet, and also showed 
how the resulting functions shifted 
with the reduction of the available 
cues to distance. 

O's task was to match a controlled 
standard disk at distances varying 
from 10 feet to 120 feet. The size of 
the standard disk was made propor- 
tional to the distance so that it always 
subtended an angle of one degree, 
thus presenting constant retinal image 
or visual angle. The varying appar- 
ent size of this controlled standard 
disk was measured by means of a 
variable comparison disk which re- 
mained always at 10 feet from O. 

The results for two sets of condi- 
tions, binocular regard and direct 
monucular regard, for Boring himself 
as O, are shown in Fig. 3. Binocular 
vision gives a function close to the 
ascending straight-line function cor- 
responding to size constancy. Re- 
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duction to monocular vision lowered 
the function nearer to the horizontal 
slope corresponding to perceived size 
dependent only upon retinal image. 
Both of the curves drawn through the 
data points are given by equation 
(II) for perceived size. 

For the upper curve, obtained for 
binocular vision, the parameter A is 
243 feet; for the lower curve, obtained 
for monocular vision, the parameter 
A is 132 feet. The value of A is re- 
duced when the perceptive cues are 
reduced. For both equations, B is 
conveniently taken as equal to A plus 
10 feet, 10 feet being the distance of 
the comparison object. The same 
formula (equation I1) applies to both 
binocular and monocular vision, with 
only a change in the parameter A. 

From these results we learn that the 
value of A is, in fact, governed by the 
available cues to distance. The value 
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of A determines the position of the 
intermediate function (expressed by 
equation II) between the two ex- 
tremes, size constancy and retinal 
image. The constant A is thus 
demonstrated to be a sensitive index 
or measure of “phenomenal regres- 
sion” and of the conditions which 
affect its attainment. 

Perceived Distance by Equal-A ppar- 
ent Intervals. In connection with this 
study two pilot experiments were per- 
formed, dealing with the relation of 
perceived distance d to physical dis- 
tance D. Both experiments were de- 
signed to serve as an initial test of 
equation (I) for perceived distance, 
and as a guide to more refined experi- 
mentation. 

Experiment I used a modified 
method of equal appearing intervals. 
O stood at one end of an unfamiliar 
indoor archery range, about 80 feet 
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Fic. 3. Perceived size as a function of distance for binocular and monocular observation. 
The data are from Holway and Boring (9) with their own correction for space error. Equation 
(II) fits both sets of observations with only a change in the parameter A. As the conditions 
are altered from direct monocular regard to direct binocular regard, the parameter A has a 
higher value, shifting the function upward to approach the top diagonal line for size constancy. 
This line rises because they increased the size of the standard in proportion to the distance to 


keep visual angle constant. 
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Fic. 4. Perceived distance as a function of physical distance (Exp. 1). The circles repre- 
sent successive equal apparent increments of distance from 0 to 10 subjective feet, and from 20 


to 40 subjecti- « feet. 


At 10 subjective feet the O was told to indicate the distance that ap- 


peared twice a: great, thus determining the physical magnitude to be assigned to 20 subjective 


feet. 
sents veridical distance perception. 


ong, and directed the experimenter, 
who moved a pointer stick at a slow 
and nearly constant rate along the 
ground away from O, to mark off 
successive increments of equal per- 
ceived length. Each successive in- 
crement of perceived distance was an 
attempt to match, by non-simultane- 
ous viewing, a memorized ‘subjective 
foot rule’’ in the case of one observer 
and a memorized ‘subjective meter 
stick’’ in the other. Successive divi- 
sion points were temporarily marked 
by a horizontal rod marker which 
was moved as soon as each next step 
had been attained. 

The results for the two Os are 
shown in Figs. 4 and 5. In order to 
illustrate the manner in which the 
data are analyzed, Table I presents 
for one observer the cumulative meas- 
ured distance D in physical units for 
each value of subjective distance d; 
the calculated value of A for each 


The data lie on the curve of equation (I) with A = 94 feet. 


The diagonal line repre- 


separate observation (A= Dd/(D—d), 
Appendix IV, equation 5); and the 
theoretical value of D from equa- 
tion (1), using the weighted mean 
value of A determined from these 
data. It is significant to note the 
experimental constancy of A over the 
series of observations, particularly 
toward the larger values of D; also 
the correspondence between the meas- 
ured and the calculated values of D. 
(For smaller distances, small experi- 
mental deviations are reflected by 
larger variations in the calculated 
value of A.) 

In Figs. 4 and 5, perceived distance 
d is plotted against physical distance 
D in the same units and to the same 
scale. The diagonal straight line is 
the theoretical function for perceived 
distance identical with true measured 
distance. The obtained points lie on 
the tangent curves given by equation 
(I). The agreement between ob- 
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tained and theoretical values is strik- 
ing. 

Perceived Size of Astronomical Ob- 
jects. An independent determination 
of the constant A was made for the 
two Os of Experiment I by means of 
direct binocular observation of the 
perceived size of an astronomical 
object, the full moon. 

Using the method of constant 
stimuli in two categories, the observer 
was presented with a series of card- 
board disks ranging from 4 inches to 
30 inches in diameter. The O was 
instructed tc hold each disk at normal 
viewing distance and, by regarding it 
alternately with the moon (which was 
slightly above the horizon), to judge 
the disk as larger or smaller than the 
moon. 

Ten judgments by the O were ob- 
tained for each disk within the transi- 
tion zone, and the stimulus which re- 
ceived an equal number of “larger” 
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and “smaller than’ judgments was 
determined for each observer. The 
results of this procedure are as follows: 


For O “J. B.” the apparent size of 
the moon is s = 9.4 inches = 0.78 
ft.; and by Appendix IV, equation 
(2), A = 87 ft. 

For O “A. G.”’ the apparent size of 
the moon is s = 11 inches = 0.92 ft., 


_and hence A = 102 ft. 


The values of A thus determined are 
in reasonably close agreement with 
the values of A (about 94 ft.) obtained 
from Experiment I on perceived dis- 
tance. 

Perceived Distances by Fractiona- 
tion. In the second of our pilot ex- 
periments, Experiment II, the method 
of fractionation (bisection) was em- 
ployed in order to construct a ratio 
scale of visual distance and to deter- 
mine the relation of visual distance to 
the stimulus magnitude over a greater 
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Fic. 5. Perceived distance as a function of physical distance (Experiment I). 


The circles 


represent successive equal apparent increments of distance from O to 14 subjective meters. 
The meter was preferred by this O as the more familiar subjective measuring rod. The data 
(also shown in Table I) lie on the curve of equation (I) with A = 28.5 meters (about 94 feet) 
and agree closely with the results of the other O of Experiment I. 
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range than was previously investi- 
gated. It was thought that this 
method would provide a more logi- 
cally defensible test of the theoretical 
function (equation I) and would also 
allow a direct comparison with the 
procedure adopted by Stevens and 
Volkmann (15), and Reese (14) for 
constructing a subjective magnitude 
function. 

O's task was to bisect each one of 
fourteen distances, ranging from 8 
feet to 200 feet, on a large flat lawn. 
A line was stretched across the lawn, 
perpendicular to O who stood a few 
inches away from the end of the 
anchored lire so that she could see 
all of the line. Bisections were per- 
formed by stopping a pointer, which 
moved back and forth along the line, 
at a point which appeared to be half- 
way between the near end of the line 


TaBLe I 


OBSERVED AND THEORETICAL VALUES OF 
PuysicaL DISTANCE AND THE 
CALCULATED VALUE OF A 
FOR EacH OBSERVATION 
OF PERCEIVED DISTANCE 


(Data from O:J. B., Exp. I) 








Calculated 
A =Dd/ Eq. I, with 
Observed (D-d) a =28.5 
Physical 
D (meters) 


Perceived 
d (meters) 





Cale. D 


A (meters) (meters) 





1.04 
2.15 
3.35 
4.65 
6.05 
7.60 


1.08 14 
3.09 5 
4.12 il 
5.12 18 
6.10 28 
7.59 29 
9.39 27 9.28 
11.08 29 11.10 
13.10 29 13.14 
15.34 29 15.40 
17.60 29 17.91 
20.37 29 20.74 
23.56 29 23.94 
27.96 28 27.52 


Coonauwf WH 














Weighted Mean A = 28.5 
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TaB_e II 


OBSERVED AND THEORETICAL VALUES OF 
Har Distance (D,) ror Eacu 
STANDARD (D;:) FOR THE Two 
Os or EXPERIMENT II 
(Calculations from equation (2) using the 
respective weighted mean values of A) 








Observer BS Observer DM 





Dn Calc 
) Di (feet) ( 


Cale 
Di (feet) ( (fi 





3.67 
4.58 
5.00 
7.83 
8.50 
11.67 
14.83 
19.58 
29.58 
35.50 
48.08 
54.25 
66.00 
70.00 


4.00 

5.15 

6.63 

7.89 
10.06 
13.67 
17.25 
21.59 
27.33 
34.71 
41.15 
$1.98 
61.52 
64.54 

















and a marker designating the total dis- 
tance to be bisected. Four sets of 
half-distance observations were ob- 
tained, the standard distances being 
presented in serial order, alternately 
ascending and descending. Each bi- 
section was marked with a golf tee, 
invisible to the observer, and the 
judged half-distances were measured 
and the tees removed between series. 
Two naive Os took part in the experi- 
ment. 

The results of this procedure (for 
each QO) are shown in Table II, and 
the half-judgment function (for the 
combined data) is plotted as illus- 
trated in Fig. 6. From such half-judg- 
ment plots, subjective magnitude 
functions may be constructed accord- 
ing to the somewhat cumbersome pro- 
cedure described in detail by Reese 
(14). For the present problem, the 
identical functions may be obtained 
by a much simpler method, by utiliz- 
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ing a convenient property of the 
basic equation (I). 

From equation (I), the half-judg- 
ment function must satisfy the rela- 
tion 

__GD)4 | 
‘A+ (4D) 
Equation (I) may also be written in 
the form 


D (2) 


DA 
“A+D 
These two functions, equations (2) 
and (3), are obviously identical. The 
relation of D, to $D; is identical with 
the relation ofdtoD. The first is the 
half-magnitude function, and the sec- 
ond is the subjective magnitude func- 
tion. This simple relation may be 
used to obtain a d curve from < D; 
curve or a D, curve from a d curve. 
By simply using a double horizontal 
scale, a single plotted curve thus 
serves a dual purpose. 

Figure 7 graphically shows this 
convenient identifying relation be- 


d (3) 


tween the subjective scale of distance 
d and the half-judgment curve D, re- 
plotted from Fig. 6. Dy, is the same 
curve as d but with the horizontal 
scale doubled. In Fig. 7, locate the 
ordinate d = D, = 22 feet. The dis- 
tance D, = 22 from the D, curve was 
judged to be half of D, = 50 feet. 
Halving this abscissa yields D = 25 
feet and, consequently, for d = 22, 
D = 25. Accordingly, the experi- 
mental curve of Fig. 6 becomes di- 
rectly a subjective magnitude function 
by simply using the upper horizontal 
scale D instead of the lower Dz scale, 
and by reading the ordinate as d in 
place of D,. 

The points marked by triangles on 
the subjective magnitude function of 
Fig.' 7 are determined by the con- 
ventional method (14) by arbitrarily 
assigning one subjective foot to one 
physical foot and then proceeding by 
interpolation of the half-judgment 
function to discover the successive 
physical magnitudes to which the suc- 
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D or Dz = Physical Distance in feet 


Fic. 6. The mean bisection data of Experiment II are plotted on the curve of equation 
(2) in the text showing the half-judgment function, or, using the upper horizontal scale (D) and 
reading d in place of D, on the vertical scale, the subjective scale of visual distance. The in- 
clined dashed line shows the objective half distance and objective full distance for the lower 
and upper horizontal scales, respectively. See text and Fig. 7 for explanation. 
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© Half - Judgment Function 


@ Subjective Distonce Scale 
from Eqs. (2) and (3). 
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Dz = Physical Distance in feet 


Fic. 7. The magnitude function for subjective distance erected from the half-judgment 
function by two different procedures. The construction lines show how the magnitude function 
is obtained from the curve fitted to the mean bisection data by simply halving the horizontal 


scale, thus utilizing a convenient property of the pesent formulation. 


The identical function 


is obtained by the conventional method (14); see triangles. 


cessively doubled numerals (2, 4, 8, 
16, 32, 64) should be assigned. This 
method yields only seven points from 
these data, whereas the present more 
expeditious and direct method allows 
us to use all of the observed points 
and hence produces a more reliable 
curve. The function obtained by 
both methods is identical.. The vali- 
dity of the basic equation (I) is thus 
doubly confirmed. 


SUMMARY 


A general unifying law of visual 
space perception is shown to be cap- 
able of rational derivation and quanti- 


tative formulation. The derived 
basic formulas for perceived size and 
distance as functions of true size and 
distance are tested and confirmed by 
application to recorded data of ante- 
cedent experiments and to new ob- 
servations of perceived size and dis- 
tance. The excellent agreement be- 
tween observed and calculated values 


indicates that these formulas are valid 
not only for the primary case of pure 
space perception, but also for the 
more general case of visual space per- 
ception (binocular or monocular) with 
any enrichment or diminution of 
visual and related cues. 

The concept of a maximum limit 
of perceived distance yields a single 
parameter, A, in the basic formulas 
whose magnitude governs the interme- 
diate position of the functions between 
the boundary conditions expressed by 
size constancy and retinal image, re- 
spectively. The numerical value of this 
parameter, A, for a given observer is 
shown to depend upon the available 
perceptual cues to distance, and, 
hence, provides a convenient direct 
index or measure of ‘phenomenal re- 
gression.” 

The next indicated study is a 
planned series of experiments to deter- 
mine the conditions and laws of vari- 
ation of the parameter A. The work- 


+ lt 
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ing definitions and formulas developed 
here, and the data thus far analyzed, 
offer a starting point and a guide for 
further research in space perception. 


APPENDIX | 


Derivation of Formulas from 
Luneburg’s Basic Metric of 
Visual Space 

The basic metric of visual space, as 
formulated and established by Lune- 
burg (12, Eq. 3.781), is 


ds? =d (d)*+ M? (dy*+cos*y-d6), (1) 


where 


‘ 


d = visual (perceived) distance, 
$s = visual (perceived) size, 

¢ = horizontal angle (azimuth), 
6 = vertical angle (altitude), 
M = linear size factor = ds/dg. 


Luneburg makes this basic metric his 
point of departure for transformations 
into elliptic, Euclidean, and hyper- 
bolic space geometries, respectively. 
With the same point of departure, we 
shall make our Euclidean transforma- 
tion with the following consistent sub- 
stitutions: 


Me=d. (Hence ds = d-d¢.) 


True (objective) distance = D = 
a/y, where 


a = interpupillary distance, 
y = angle of binocular convergence. 








Perceived distance = d = Coes 
y+u 
Hence 
‘ a-dy 
d (d) =— ‘ 
O-- G+ 


(uw (mu) = aconstant added toy. It 
limits maximum perceived d, keeping 
d finite when + is zero or impercepti- 
bly small. In other words, » = the 
minimum or threshold value of per- 
ceived convergence or horizontal dis- 
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parity in viewing very distant ob- 
jects.) 

Substituting M and d (d) in the 
basic metric, equation (1), 
ey 
(y + wu)? \(y + uw)? 


+ cost: dt), (2) 


+d¢ 





ds* 


dy 
Note the form of the term age aR. 





This is a logarithmic form, correspond- 
ing to the Weber-Fechner law. Sen- 
sitiveness to dy diminishes with in- 
creasing y. Also note: Vision is 
1/(y + mw) times as sensitive to dy as 
to d g or d @. Thus the visual dis- 
tance factor is 1/(y + wu) times the 
visual size factor. 

The transformed metric, equation 
(2), contains two constants (a, yu) 
varying with the observer. 

By equation (2), with d g = 0 and 
dé@= 0, 


d =a/(y + u). 
But 
D=a/y. 
Hence, 
d OY a/p A 





D~ (y+u) a/uta/y  A+D’ 
with 
A= a/p . 


(Two observer constants are thus re- 


placed by one.) 
Similarly, by equation (2) with dy 
= 0 and dé = 0, 


ds=M-dg=d-dg, or s=d-¢. 


But objective true size Sp (to the same 
scale as D) is given by 


dS) = D-dg, or So = D-¢. 


Hence 
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Let o (sigma) = A/B, where 
B=A +4, 


and 6 (delta) = D at normal viewing 
distance. Then, by definition, sub- 
jective true size S is given by equation 
(3) as 
S A A 
S245 73" * 
From equations (3) and (4), 


s Bd _ id B 5 

S"AD~cD-~Aa+D © 
Note: d is perceived distance, to the 
same scale as visual size s. (yg = 
s/d.) 

D is measured (objective) distance, 
to the same scale as measured (ob- 
jective) size Sy. (g = S,/D.) 

A, B, D, and d are given in the same 


units. 
So, S, and s are given in the same 


units. 

We have thus derived two simple 
formulas, for perceived distance and 
perceived size, respectively: 


(I) 


é 
D 


5 
S”A+D 


APPENDIX II 


Derivation of Formulas from 
Principles of Perspective 

The basic geometry of vision is rep- 
resented in Fig. Ail, and the basic 
geometry of perspective is shown in 
Fig. A2. 

Referring to Fig. A2, the compress- 
ion of distance in perspective corre- 
sponds to the compression of perceived 
distance in visual space; and the per- 
spective reduction of size (width or 
height) between parallel lines reced- 
ing from the observer corresponds to 
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the reduction of perceived size in 
visual space. The parallel horizontal 
lines receding from the observer are 
seen in perspective as converging to 
a “vanishing point’’ in the horizon 
line. These proportional relations 
common to perspective and visual 
space are shown in Fig. A2. (VP is 
the “vanishing point’; DP is the 
“division point.”” From the objective 
plan below the ground line, the geo- 
metrical perspective is constructed 
above the ground line as shown.) 


In Figs. Al and A2, 


D = True (objective) distance; d 
= Perceived distance. 

So = True (objective) size; s = 
Perceived size. 

A = Horizon distance = Apparent 
distance of vanishing points 
in the horizon. 


By Fig. Al, the visual angle ¢ at 
the nodal point of the eye is given by 


g= = 


Hence, 
fg: 
} eee 
By similar triangles in Fig. A2, 
s A-d 
So A 
From equations (1) and (2), 
a _A-d 
D A 
This reduces to 
Oi By. 
D A+D 
Hence, by equation (1), 
FSi. 
So A+t+D 


(1) 


(2) 
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By definition of the proportionality 
constant ¢ (sigma), 


S A 
c= = 


So B 
From equations (4) and (5), 


Sees on 
S”~A+D 
The two basic formulas (I) and 
(II), thus written directly from the 
geometry of perspective, are identical 
with the two basic formulas derived 
rigorously in Appendix I from Lune- 
burg’s more theoretical metric of 
visual space. 


(5) 


(I) 


AppENprIx III 


Derivation of Basic Formula by 
Composition of the Laws of Size 
Constancy and Visual Angle 


It is known that the law of per- 
ceived size is intermediate between 
the law of retinal image (visual angle) 
and the law of size constancy (2, 9). 
This intermediate relationship has 
been called “regression to the real 
object” (16). 

The law of retinal image is ex- 
pressed by 
6 
D’ (1) 


wie 


Nodal point 





d 
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where 6 (delta) is the distance at 
which an object appears ‘“‘true size” 
(s = S). This would make the per- 
ceived size s of an object vary inversely 
with its distance D. 

The law of size constancy is ex- 
pressed by 
A 


a 
oi 
This would make s = S, independent 
of distance. 

By composition of equations (1) 
and (2), combining the two extreme 
laws into a single generalized inter- 
mediate law, we have 


(2) 


Ss A+é 
S"A4D (i) 


The value of A (or the ratio of A:D) 
determines the relative position of the 
interpolated law (II) between the two 
extremes, (1) and (2). Compare the 
Thouless (16) coefficient. For A = 
0, equation (II) becomes the law of 
retinal image. For A = infinity, 
equation (II) becomes the law of size 
constancy. For A = Dior D = A), 
equation (II) gives a value exactly 
midway between the two limiting 
laws, (1) and (2). f 

The magnitude of A (between zero 
and infinity) is thus a measure of the 


So 
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Geometry of visual space. 
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Fic. A2. Perspective geometry. 


For 
For 


“regression to the real object.” 
A =0, the regression is zero. 
A = infinity, the regression is com- 


plete. For A = D, the departure is 
ialf-way. The difference between 
retinal image and size constancy is 
divided into two parts by the per- 
ceived size; the ratio of the two parts 
is A:D. 

At small values of D (compared to 
A), the perceived size, by equation 
(11), is close to size constancy; the 
difference is so small as to be missed 
by investigators. 

At very large values of D (com- 
pared to A), the perceived size, by 
equation (II), is close to the retinal 
image law. The sun and the moon 
appear nearly the same size because 


their retinal images are nearly the 
same size, although their real sizes are 
880,000 miles and 2160 miles, re- 
spectively, a ratio of 407 to 1. 

Equation (II) is the basic formula 
for visually perceived- size, identical 
with the basic formula derived in the 
preceding Appendices. It is a gen- 
eralized, unifying expression, em- 
bracing the laws of size constancy and 
retinal image as special or limiting 
cases and bridging the gap between 
those two extreme laws. 


APPENDIX IV 
Experimental Determination of the 
Visual Constant A 

The visual constant A for a given O 
under given conditions may be deter- 
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mined experimentally in a variety of 
ways, either from a single size or dis- 
tance observation of an object of 
known physical size and distance, or 
from two or more related size or dis- 
tance observations. A few of the 
convenient, practical methods are 
outlined below: 


1. From a size observation of an 
astronomical object. The size of reti- 
nal image or the subtended visual 
angle ¢ (phi) for any object is given by 


So Ss 

9 eRe> (1) 
For an astronomical object, the physi- 
cal distance D is enormous or practi- 
cally infinite relative to A. For this 
condition, the basic distance formula 
(I) yields d = A. Hence 


A=d=~. (2) 


Thus A is given directly if Sp and D 
are known and s is observed. 
Thus, for the sun, 


So 880,000 miles 


? = D ~ 92,000,000 miles 
= 0.0096 radians. 





If the apparent size of the sun is 


s = 15 in. = 1.25 ft., 
then 
$ 1.25 ft. 
A =d=— = 90006 ~ 130 ft. 


Similarly, for the moon, 


So in 2160 miles 
?"D ~ 244,000 miles 
= 0.0090 radians. 


If the apparent size of the moon is 


s = 14in. = 1.17 ft., 


then 
s 1.17 ft. 


A=né=~ = 00000 





= 130 ft. 


Sn id Cale a cee 
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Note that the sun and the moon sub- 
tend nearly equal visual angles ¢, that 
of the sun being somewhat larger. 
Because both objects are perceived at 
the same distance (d = A), the appar- 
ent sizes are proportional to the re- 
spective visual angles. At astrono- 
nomical distances, visual size more 
nearly follows the law of the retinal 
image. 

2. From a size observation of a non- 
astronomical object. By the basic 
formula (II) for visual size, 


ye. Be 
&-244D' 
where r is the reduction ratio for visual 


size s from physical size Sp. Solving 
equation (3) for A, 


(3) 


T= 


Tr 
a—*) 


This gives A directly from a single 
observation when physical distance 
D, physical size So, and perceived 
size s are known for any given object. 

3. From perceived distance of a non- 
astronomical object. The basic for- 
mula for visual distance is 


d A 





A= -D- (4) 


D~A+D @ 
Solving for A, 
D-d 
A= Soa (5) 


This gives A directly from a single 
distance-observation for any object, 
when physical distance D and visual 
distance d are given. 

4. From two size-observations of the 
same object. Let s, be the perceived 
size of a given object at measured dis- 
tance D,, and s; the perceived size of 
the same object at measured distance 
D;. By the basic formula (II) for 
visual size, 





(6a) 
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Fic. A3. Perceived size and perceived distance. 


Yah. Ber 
S A+D, 
From equations (6a) and (6b), 


Sy A+D, 
an (7) 


Solving equation (7) for A, 
Ds. — D 
se (8) 


Sa = $s 


(6b) 


This gives A directly when the two 
physical distances, D,; and D2, and 
the two respective perceived sizes, 51 
and 52, of a given object are known. 

5. From a half-distance observation. 
Let D, be the measured distance of 
the apparent (visually perceived) mid- 
point of the measured distance Dz. 
By the basic formula (I), the half- 
distance observation is expressed by 
the pair of equations: 


A-D,; 


a=77D, 


(9a) 


A-D, 


d, = 2d, = a+ Be (9b) 


Solving this pair of equations for A 


D:D; 
D; — 2D; 


This gives A directly when any 
measured distance D, and the appar- 
ent hali-distance D, are given. 

6. From a series of observations in a 
visual size or distance experiment. As 
a rule, a larger number of observations 
make up a size or distance experi- 
ment. In such case, the best mean 
value of A is obtained as a weighted 
mean of the values of A computed for 
the individual observations (using 
the respective distances D as the 
weight factors). Other convenient 
methods of curve-fitting or short-cut 
applications of the Theory of “east 
Squares may be used to determine the 
best mean value of A to fit the series 
of observations. 

After A is determined, B is given 
by B = A + 4, where 4 (delta) is the 
normal viewing distance for the ob- 
server, usually 1 or 2 ft. 

7. From coupled observations of per- 
ceived size and perceived distance. 


A= (10) 
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From two or more observations of a 
given object viewed at different dis- 
tances, the recorded values of per- 
ceived size may be plotted directly 
against the corresponding values of 
perceived distance. The resulting 
graph (Fig. A3) is a straight line, in- 
clined downward toward the right. 


The relation is given by 
BA-d 
2 a 


s/S = B/A. For 


§ 
S 
For d = 0, 


s/S = 0, d =A. Accordingly the 
straight line graph intercepts the 
vertical axis at B/A and the horizon- 
tal axis at A. The two parameters, 
A and B, are thus given directly by the 
plotted graph. 
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